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Asymptotic geometry of the mapping class group 
and Teichmuller space 

Jason A Behrstock 

In this work, we study the asymptotic geometry of the mapping class group and 
Teichmuller space. We introduce tools for analyzing the geometry of "projection" 
maps from these spaces to curve complexes of subsurfaces; from this we obtain 
information concerning the topology of their asymptotic cones. We deduce several 
applications of this analysis. One of which is that the asymptotic cone of the 
mapping class group of any surface is tree-graded in the sense of Druju and Sapir; 
this tree-grading has several consequences including answering a question of 
Drufu and Sapir concerning relatively hyperbolic groups. Another application is 
a generalization of the result of Brock and Farb that for low complexity surfaces 
Teichmuller space, with the Weil-Petersson metric, is <5-hyperbolic. Although for 
higher complexity surfaces these spaces are not (5-hyperbolic, we establish the 
presence of previously unknown negative curvature phenomena in the mapping 
class group and Teichmuller space for arbitrary surfaces. 

20F67; 30F60, 57M07, 20F65 



1 Introduction 

1.1 Overview of main results 

In studying the geometry and dynamics of a surface S, some of the most important 
mathematical objects are its mapping class group AiCQ(S), its Teichmuller space, and 
the complex of curves. We prove new results about the asymptotic geometry of these 
objects through a study of their asymptotic cones. 

Recently, Druju and Sapir showed that any group which is (strongly) relatively hyperbolic 
has cut-points in its asymptotic cone [20]. They asked whether this property of having 
asymptotic cut-points gives a characterization of such groups. We prove the following. 

Theorem 7.1 If 5 is any surface and Cone w (.MC<7(S)) is any asymptotic cone, then 
every point of Cone^iAiCQiS)) is a global cut-point. 
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Similarly one obtains global cut-points in each asymptotic cone of Teichmiiller space 
with the Weil-Petersson metric. 

This result answers the question of [20] in the negative for, as shown by Anderson, 
Aramayona and Shackleton [1] and independently by the author with Druju and Mosher 
[4], the group AiCQ(S) is not hyperbolic relative to any family of finitely generated 
subgroups. This establishes the mapping class groups as the first known examples of 
groups which are not relatively hyperbolic, but whose asymptotic cones are tree-graded. 
See [4] for other applications of Theorem 7. 1 and its Corollary 7.3. 

We obtain the following more specific result on the asymptotic geometry of the mapping 
class group. 

Theorem 6.5 In any surface, S, and any asymptotic cone, Cone tlJ (.MCC/(S')), there 
exists an isometrically embedded IR -tree, Fq C Cone u {M.CQ{S)) with the following 
properties: 

• Fq contains infinitely many distinct infinite geodesies arising as asymptotic cones 
of axes of pseudo-Anosovs; 

• for each pair of points on Fq, there exists a unique topological path in 
Cone^AiCQiS)) connecting them and this path is contained in Fq. 

A similar result holds replacing the mapping class group by Teichmiiller space with the 
Weil-Petersson metric. 

One application of Theorem 6.5 is that it shows that the directions of pseudo-Anosov 
axes are negatively curved, ie, in the direction of each pseudo-Anosov the divergence 
function is superlinear. This readily implies that these directions are quasi-geodesically 
stable, ie, for the quasi-geodesic axis given by taking powers of a pseudo-Anosov, 
there is a bound M(K, C) for which any (K, C)-quasigeodesic with endpoints on this 
path must be completely contained within the M neighborhood of the axis. We also 
establish that the Weil-Petersson metric on Teichmiiller space is negatively curved in the 
directions of geodesies all of whose subsurface projections are uniformly bounded. Thus 
although for higher complexity surfaces the Weil-Petersson metric is not 5 -hyperbolic, 
this result shows that there are directions in which one gets the useful consequences of 
(5-hyperbolicity. (Note that such a result does not follow from the negative sectional 
curvature of the Weil-Petersson metric since these curvatures are not bounded away 
from 0.) Results on stability of certain families of geodesies in the Teichmiiller metric 
was proved by Minsky [46], thus our results complete this analogy between these three 
spaces. (After completing this result, we were informed by Minsky that in current joint 
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work with Brock and Masur they will give an alternate proof of the stability of a similar 
family of Weil-Petersson geodesies.) 

When the complexity of the surface is small enough, we recover the following result 
which was first established by Brock and Farb [16]. (After this work was completed 
another alternate proof of this result appeared using interesting techniques involving the 
geometry of CAT(0) spaces, see Aramayona [2].) 

Theorem 5.1 With the Weil-Petersson metric, the Teichmiiller spaces for the surfaces 
Si t 2 and S ,5 are each 5 -hyperbolic. 

This result implies that any subset of an asymptotic cone of Teichmiiller space of Sip 
or ^0,5 contains a point which disconnects it. By contrast, the asymptotic cone of the 
Teichmiiller space of any surface of higher complexity contains bi-Lipschitz embeddings 
of W with n > 2, these subsets can not be disconnected by any point. Nonetheless, 
Theorem 7.1 implies that each point is a global cut-point; this is one of several ways to 
consider Theorem 7.1 as a generalization of Theorem 5.1. 

The original motivation for this work was to study quasiflats in the mapping class 
group. In particular, it had been conjectured by Brock-Farb that the mapping class 
group admits quasiflats only up to the maximal dimension for which it contains abelian 
subgroups (by work of Birman, Lubotzky and McCarthy [8] the latter number is known 
to be 3g + p — 3). Using Theorem 6.5 as the base case for an inductive argument, 
the author in joint work with Minsky [6] have proven this "Rank Conjecture" (using 
different techniques Hamenstadt [26] has also obtained a proof of this result). 

1.2 Methods 

The above results are produced by analyzing the complex of curves, a simplicial complex 
encoding intersection patterns of curves on a surface. This complex was first introduced 
by Harvey and has since been used to prove many deep results about the mapping class 
group (see Harer [27], Harvey [28], Hatcher [29], and Ivanov [34]). More recently 
Masur and Minsky proved 5-hyperbolicity of the complex of curves [42], and then used 
this to establish new methods for studying the mapping class group [43]. The complex 
of curves has also found many applications to the study of 3-manifolds; in particular it 
has proved crucial in the recent proof of the Ending Lamination Conjecture (see Brock, 
Canary and Minsky [45, 15]). For other applications see: Behrstock and Margalit [5], 
Bestvina and Fujiwara [7], Bowditch [11], Hempel [30], Irmak [33] and Margalit [41]. 
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For each subsurface X C S, we consider a "projection" map 

tt x : MCG(S) -» C(X) 

from the mapping class group of S into the curve complex of X, which is closely 
related to the map taking a curve in 5 to its intersection with X. The vertices of the 
curve complex of a surface, C(X), are homotopy classes of essential, non-peripheral 
closed curves on X. The distance between two distinct homotopy classes of curves, 
[i, v G C(X), is a measure of how complicated the intersection of representatives of 
these curves must be. For example as long as X has genus larger than one: dx(^, v) = 1 
if and only if /j, and v can be realized disjointly on X whereas dx(^, v) > 3 if and only 
if \i and v fill X (fill means that for any representatives of \i and v the set X\(fiU u) 
consists of disks and once-punctured disks). 

The following tool provides the starting point for our analysis. 

Theorem 4.3 (Projection estimates) Let Y and Z be two overlapping subsurfaces of 
S. Then for any pt £ MCG(S): 

d C{Y )(dZ,fi) > M =>■ d C{Z ){dY^) < M. 

Where M depends only on the topological type of the surface S. 

Combined with work of Masur-Minsky [43] this result is useful in studying the metric 
geometry of mapping class groups and Teichmuller space via their subsurface projections. 
(Theorem 4.3 can be generalized to study projections of the mapping class group into a 
product of curve complexes. Doing so yields an interesting picture of certain aspects 
of the asymptotic geometry of the mapping class group, this direction is developed 
further by the author in [3].) Our study of the metric geometry of mapping class 
groups and Teichmuller space is done via an analysis of their asymptotic cones. We 
define a projection map to send elements of the mapping class group of a surface S 
to the mapping class group of a subsurface Z C S. This projection map has several 
nice properties, for instance it sends points to uniformly bounded diameter subsets 
(Lemma 6.1), and is used in defining an important new object introduced in this paper: 
the strongly bounded subset, Fq , of the asymptotic cone. This subset is part of the 
transversal tree of Cone^AiCQiS) and plays a large role in the Behrstock-Minsky 
proof of the Rank Conjecture [6]. 

1.3 Outline of subsequent sections 

In Section 2 we discuss the tools of asymptotic geometry which are fundamental to the 
constructions and results in the rest of this work. All the results in this section are well 



Qeometry & Topology 10 (2006) 



Asymptotic geometry of the mapping class group 



1527 



known and thus we are as much setting up notation as we are reminding the reader of 
which tools to keep at their fingertips for the remainder of this work. 

Surfaces are the main characters throughout this work. In Section 3 we discuss the 
marking complex, which is our quasi-isometric model of choice for dealing with the 
mapping class group. We also remind the reader of various facts we will need concerning 
the mapping class group, the complex of curves, and Teichmuller space. 

The original material in this work begins with the results in Section 4. Here we calculate 
estimates on the image of the "projection" map from the marking complex into the 
product of the curve complexes of the constituent subsurfaces. In this section we 
produce our starting point, Theorem 4.3 (Projection estimates), which is a technical 
result we use in our analysis of the mapping class group. In Theorem 4.4 (Projection 
estimates; geometric version) we provide a geometric interpretation of Theorem 4.3. 

The final three sections apply the Projection estimates Theorem and related techniques to 
various situations. In the first of these we (re)prove hyperbolicity of the Weil-Petersson 
metric on Teichmuller space and of the mapping class group in some low complexity 
cases, Theorems 5.1 and 5.2. The techniques introduced in this section provide a good 
warm-up for those used to prove the key step in its generalization, Theorem 6.5. 

In Section 6 we generalize the hyperbolicity of low complexity mapping class groups 
and Teichmuller spaces to deduce the existence of negatively curved directions in the 
cases of higher complexity surfaces. The main result of this section is Theorem 6.5 
which identifies some new aspects of the geometry of Cone^i-MCG) which we exploit 
in the following section. 

We end with Section 7 where we relate our earlier work to questions about relative 
hyperbolicity in the mapping class group. Here we answer some questions about 
mapping class groups and tree-graded spaces and raise some new question. 
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2 Background I: Large scale geometry 
2.1 Geometric Group Theory 

There are several equivalent definitions for hyperbolicity; throughout this paper we will 
have in mind Rips' thin triangle characterization: 

Definition 2.1 A geodesic metric space X is called 5-hyperbolic if there exists a 
constant 8 so that for each triple a,b,c G X and each choice of geodesies [a, b], [b, c], 
and [a, c] one has that [a, b] is contained in a 5 -neighborhood of the union of [a, c] 
and [b,c]. 

Note that we write [a, b] to denote a geodesic between a and b even though this choice 
need not be unique. 

Definition 2.2 Let (X, dx) and (Y, dy) be metric spaces. A map 4> : X — > Y is called 
a (K, C)-quasi-isometric embedding if there exist constants K > 1 and C > such 
that for all a,b G X 

^d x (a, b)-C< d Y ((f>(a), <P(b)) < Kd x (a, b) + C. 
K 

It is a C)-quasi-isometry if additionally, ^ has the property that every point of Y 
lies in the C-neighborhood of <f){X) . 

When there exists a (K, C)-quasi-isometry <fi : X — > F between two metric spaces X 
and F, we say that they are (K, C) -quasi-isometric and denote this X ^k,c Y ; When 
the choice of constants is not important (as will often be the case), we simply say X and 
Y are quasi-isometric, denoted X ~ Y. 

For geodesic spaces hyperbolicity is a quasi-isometry invariant; note that the hyperbol- 
icity constant may change (see Ghys and de la Harpe [22]). 

In order to apply this definition to groups, we recall the natural left invariant metric on a 
group G with finite generating set S. The word metric on G relative to the generating 
set S is given by dc,s(a, b) = \a~ l b\ where ja" 1 ^! is the smallest number of letters 
needed to represent the group element aT^b 'm terms of letters from S and their inverses. 



Qeometry & Topology 10 (2006) 



Asymptotic geometry of the mapping class group 



1529 



Although this metric depends on the choice of generating set, for finitely generated 
groups, its quasi-isometry type does not. Thus since hyperbolicity is a quasi-isometry 
invariant, the notion of a group being hyperbolic is well defined. 

A basic non-hyperbolic example where we will use the notion of quasi-isometries is 
Z" (with n > 1), which we (easily) observe is quasi-isometric to W. In any metric 
space X one can study quasi-isometric embeddings of W into X, these are called 
quasiflats of rank n. Generalizing the above Euclidean example, one way to quantify 
non-hyperbolicity of a metric space X is by calculating its geometric rank, namely 
the largest rank of any quasiflat in X. Having geometric rank greater than one is an 
obstruction to hyperbolicity, as it forces X to have arbitrarily fat triangles. 

We now record a technical lemma which will be useful in Section 5. The proof is an 
exercise in manipulating thin triangles (see [3] for a proof). When the ambient space is 
clear we use the notation Bg(x) to denote the closed ball of radius 5 around a point x. 

Lemma 2.3 Let Z be a 5 -hyperbolic space and 7 a geodesic in Z. If (f> : Z — > 2 7 
sends each point of Z to the set of closest points on 7, then for each point z £ Z we 
have diam(0(z)) < 45 + 2. 

2.2 Asymptotic Cones 

Ultrafilters and ultraproducts have been used in logic since Godel introduced them in 
the 1930's. Among other applications they can be used to give a slick proof of the 
compactness theorem of first order logic and to rigorize calculus, a la Robinson's non- 
standard analysis. Van den Dries and Wilkie introduced these tools to topologists when 
they reformulated (and slightly strengthened) Gromov's famous Polynomial growth 
Theorem using this language (compare Gromov [23] and van den Dries and Wilkie 
[18]). These techniques streamlined parts of the proof, replacing an iterated procedure 
of passing to subsequences with one ultralimit, and broadened the applicability of 
Gromov's construction, Gromov's original definition of asymptotic cone only applied to 
nilpotent groups whereas the ultrafied definition works for any finitely generated group. 
(For other applications of asymptotic cones in geometric group theory see: Burillo [17], 
Bridson [12], Dru{u[19], Druju and Sapir [20], Kapovich and Leeb [37] Kleiner and 
Leeb [39], Kramer, Shelah, Tent and Thomas [40] and Riley [51].) 

A non-principal ultrafilter on the integers, denoted uj, is a nonempty collection of sets 
of integers with the following properties: 

(1) If S\ G uj and S2 £ w, then S\ n S2 6 uj. 
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(2) If Si C S2 and Si G u, then S2 G w. 

(3) It is maximal in the sense that for each S C Z exactly one of the following must 
occur: SGworZ\SGu;. 

(4) does not contain any finite sets. (This is the non-principal aspect.) 

Since we are only concerned with non-principal ultrafilter, we will simply refer to these 
as ultrafilters. 

An equivalent way to view an ultrafilter is as a finitely additive probability measure, 
with no atoms, defined on all subsets of integers and which takes values in {0, 1}. This 
way of thinking is consistent with the intuition that the ultrafilter sees the behavior on 
only the large subsets of N and when this is done one says that lo -almost every integer 
has a given property when the set of integers with this property is in lo . 

For an ultrafilter lo, a metric space (X, d), and a sequence of points (y,-)i 6 N we define y 
to be the ultralimit of (y/),gpj with respect to lo, denoted y = lim^y,, if and only if for 
all e > one has {/ 6 N : d(yi,y) < e} G lo. When the ultralimit of a sequence exists 
it is a unique point; further, it is easy to see that when the sequence (y,) i( =N converges, 
then the ultralimit is the same as the ordinary limit. It is also worth noting that in a 
compact metric space ultralimits always exist. 

Fix an ultrafilter lo and a family of based metric spaces (Xi,Xi,di). Using the ultrafilter, 
a pseudo-distance on n<eN ^ * s P rov ided by: 

UJ 

The ultralimit of (Xj,jt,-) is then defined to be: 

lim(Xi,Xi) = {y£ TTX; : d u (y, (x,-)) < 00}/ ~, 

UJ 

where for two points y, z G ILeM^' we define y ~ z if and only if d u (y, z) = 0. 

The pseudo-metric d w takes values in [0, oo\ — considering only those points in 
\rm w {Xi,Xi) which are a finite distance from the basepoint restricts our attention to a 
maximal subset where the pseudo-distance does not obtain the value 00 . The relation 
~ quotients out points whose pseudo-distance from each other is zero; in an analogy to 
measure theory we think of this process as identifying sequences which agree almost 
everywhere. These two conditions combine to make d u (y, z) into a metric. 

Just as ultralimits for sequences of points generalize the topological notion of limit, 
ultralimits of sequences of metric spaces generalize Hausdorff limits of metric spaces. 
In particular, for a Hausdorff precompact family of metric spaces the ultralimit of the 
sequence is a limit point with respect to the Hausdorff topology (see [39] for a proof of 
this and other related facts). 
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Definition 2.4 The asymptotic cone of (X, (xi), (dj)) relative to the ultrafilter uj is 
defined by: 

Cone^X, (xi), (</,•)) = lim(X,x,-, — ). 

When it is not a source of confusion we tend to suppress writing the basepoint. Note 
that for for homogeneous spaces (like Cay ley graphs), the asymptotic cone does not 
depend on the basepoint (xi) and thus we shall always take this to be ( 1 ) and drop it 
from the notation. 

When taking such ultralimits, we often use the notation lim^ j(x a ,O a ), when there 
is an implied ambient space T a with basepoint a , in order to emphasize that we are 
looking at an ultralimit in Cone^T^, a ). 

When considering the asymptotic cone, especially for the first time, the following 
quotation from Thomas Hobbes (1588-1679), will likely seem relevant. 

To understand this for sense it is not required that a man should be a 
geometrician or a logician, but that he should be mad. 

Asymptotic cones provide a way to replace a metric space with a "limiting" space which 
carries information about sequences in the original space which leave every compact 
set. This process encodes the asymptotic geometry of a space into standard algebraic 
topology invariants of its asymptotic cone. 

A common use of asymptotic cones is in their relation to hyperbolicity as demonstrated 
by the next result and Theorem 2.7. First we give a preliminary definition. 

Definition 2.5 An R-tree is a metric space (X, d) such that between any two points 
a,b G X there exists a unique topological arc 7 connecting them and 7 is isometric to 
the interval [0,d x (a,b)] C R. 

Proposition 2.6 For a sequence Si — ► , the ultralimit of 5i -hyperbolic spaces is an 
R-tree. 

In particular, if X is a 5 -hyperbolic space, then Cone^X is an R-tree. 

If X is a 5-hyperbolic space which is sufficiently complicated (eg, the cayley graph 
of a non-elementary hyperbolic group) then Cone^X is an R-trees with uncountable 
branching at every point. 

A partial converse to Proposition 2.6 is provided by the following well known result. 
The statement of this theorem first appeared, without proof, in Gromov [24] . A proof 
can be found in Druju [19]. 
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Theorem 2.7 If every asymptotic cone of a metric space X is an R-free, then X is 
5 -hyperbolic. 

The following is an easy observation that simplifies the situation when dealing with 
finitely many equivalence classes of Seq, where Seq is the set of sequences in a 
countable alphabet considered up to the equivalence relation they they agree w-almost 
everywhere. When we consider Seq in the sequel the alphabet will consist of subsurfaces 
of a given surface. 

Lemma 2.8 For any finite set F C Seq, the elements of F are pairwise distinct in Seq 
if and only if there exists a set K G uj for which each 7,7' g T has 7; / 7- for every 
i G K 

Proof Fix a finite set F C Seq. 

(=£•) The maximal clause in the definition of ultrafilter states that for each K C N 
either A , £worN\^ £w. We suppose that for each pair of elements 7,7' £ T we 
have 7^7' and thus the set K of indices where 7,- = 7- must have K G" to . Maximality 
of the ultrafilter then implies that N \ K G uj . Thus for each i G N \ K G u we have 
7i -/7 ( , ,we define N\K = K' J ^. 

Since ultrafilters are closed under finite intersections, the intersection over all pairs 7, 7' 
of K jn / yields a set J G uj where 7,- / 7- for every i G J and every 7, 7' G F. 

(<^=) Let K G uj be the set of indices for which 7, / 7- for each i G K and 7,7' G F . 
Then for any 7,7' we have {/ : 7, = 7-} G N \ K and hence is not in u>. Thus 
7^7. □ 

We end this section with a summary of some standard results about asymptotic cones 
that we will use in the sequel (see Bridson and Haefliger [13], Kleiner and Leeb [39] or 
Kapovich [35]). 

Proposition 2.9 Fix a non-principal ultrafilter uj . 

(1) Cone w (X) is a complete metric space. 

(2) Cone^Xi x X 2 ) = Cone^) x Cone w (X 2 ) 

(3) Cone^R" = R" 

(4) The asymptotic cone of a geodesic space is a geodesic space. 

(5) A (K, C)-quasi-isometry between metric spaces induces a K -bi-Lipschitz map 
between their asymptotic cones. 
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3 Background II: Surfaces and related structures 

3.1 Complex of Curves 

We use 5 to denote a connected, orientable surface of genus g = g(S) with p = p(S) 
punctures. (Note that we will interchangeably use the words puncture and boundary, 
as the distinction does not affect the results contained in this work.) We use the terms 
subsurface and domain to refer to a homotopy class of an essential, non-peripheral, 
connected subsurface of S (subsurfaces are not assumed to be proper unless explicitly 
stated). When we refer to the boundary of a domain this will mean the collection of 
(homotopy classes of) non-peripheral closed curves which bound the domain as a subset 
of 5. 

We now recall the construction of the complex of curves and relevant machinery 
developed by Masur and Minsky which we will use in our study; for further details 
consult [43]. We use £(S) = 3g(S) + p(S) — 3 to quantify the complexity 1 of the surface 
5. Recall that when positive, 3g(S) + p(S) — 3 is the maximal number of disjoint 
homotopy classes of essential and non-peripheral simple closed curves which can be 
simultaneously realized on S. Naturality of £(S) as a measure of complexity is justified 
by the property that it decreases when one passes from a surface to a proper subsurface 
(recall our convention of considering surfaces up to homotopy). Since our interest is in 
hyperbolic surfaces and their subsurfaces, we only consider subsurfaces with £ > —2 
(thus ignoring the disk and the sphere). Additionally, as it is not a hyperbolic surface 
nor does it appear as a subsurface of any hyperbolic surface we will usually ignore Sift 
(although much of our discussion has analogues for this case); thus £ = is used only 
to denote the thrice punctured sphere. 

Introduced by Harvey [28] to study the boundary of Teichmuller space, the complex 
of curves has proven to be a useful tool in the study of Teichmuller space, mapping 
class groups, and 3 -manifolds. The complex of curves is a finite dimensional complex 
which encodes information about the surface via the combinatorics of simple closed 
curves. Analysis using the curve complex is necessarily delicate since the complex is 
locally infinite except in a few low genus cases. 

Definition 3.1 The complex of curves for S, denoted C(S) , consists of a vertex for every 
homotopy class of a simple closed curve which is both non-trivial and non-peripheral. 

'In [43] they use £(S) = 3g(S) + p(S), but we use the current version because it has better 
additive properties when generalized to disconnected surfaces, as we need in Section 6. 
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The N-simplices of C(S) are given by collections of N + 1 vertices whose homotopy 
classes can all simultaneously be realized disjointly on S. 

This definition works well when £(S) > 1 , however it must be modified slightly for the 
surfaces of small complexity, which we refer to as the sporadic cases. The two cases 
where £(5) = 1 are S^i and So,4 (the following discussion works for S^q as well); here 
any two distinct homotopy classes of essential simple closed curves must intersect (at 
least once in S^i and twice in So,4)- m these cases the above definition of C(S) would 
have no edges, only vertices. Accordingly, we modify the definition so that an edge is 
added between two homotopy classes when they intersect the minimal amount possible 
on the surface (ie, once for Si ; \ and twice for So,4)- With this definition C(Si,i) and 
C(5o,4) are each connected, indeed they are each isometric to the classical Farey graph. 

When £(S) = 0, —2, or —3 then C(S) is empty. The final modification we make is for 
the case where we have A C S with £(A) = — 1, the annulus. The annulus doesn't 
support a finite area hyperbolic metric, so our interest in it derives from the fact that 
it arises as a subsurface of hyperbolic surfaces. Indeed, annuli will play a crucial 
role as they will be used to capture information about Dehn twists. Given an annulus 
A C S, we define C(A) to be based homotopy classes of arcs connecting one boundary 
component of the annulus to the other. More precisely, denoting by A the annular cover 
of S to which A lifts homeomorphically, we use the compactification of H 2 as the 
closed unit disk to obtain a closed annulus A. We define the vertices of C(A) to be 
homotopy classes of paths connecting one boundary component of A to the other, where 
the homotopies are required to fix the endpoints. Edges of C(A) are pairs of vertices 
which have representatives with disjoint interiors. Giving edges a Euclidean metric of 
length one, in [43] it is proven that C(A) is quasi-isometric to Z. 

The following foundational result, which is the main theorem of [42], will be useful in 
our later analysis. For another proof see Bowditch [9], which provides a constructive 
computation of a bound on the hyperbolicity constant. 

Theorem 3.2 (Hyperbolicity of C(S) [42, 9]) For any surface S, C(S) is an infinite 
diameter 5-hyperbolic space (as long as it is non-empty). 

Throughout this paper we use the convention that "intersection" refers to transverse 
intersection. Thus for example if we consider a subsurface Y C S and an element 
7 6 dY which is not homotopic to a puncture of S then we consider the annulus around 
7 to not intersect Y. 
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For a surface with punctures one can consider the arc complex C'{S) , which is a close 
relative of the curve complex. When £(S) > — 1 we define the vertices 2 of C'(S) 
(denoted C' (S)) to consist of elements of Co (5) as well as homotopy classes of simple 
arcs on S with endpoints lying on punctures of S, which don't bound a disk or a once 
punctured disk on either side. As done for the curve complex, we define 7V-simplices 
of C'(5) to be collections of N + I vertices which can simultaneously be realized on 
the surface as disjoint arcs and curves. For annuli we define C'(A) = C(A). 

The arc complex arises naturally when one tries to "project" an element 7 £ C(5) into 
C(F) where F C S. When £(F) > we define 

A: C (S)^2 c o^ 

by the following: 

• If 7 n Y = 0, then define 7^(7) = 0. 

• If 7 C Y, then define 7^(7) = {7} . 

• If 7 Pi dY 7^ 0, then after putting 7 in a position so it has minimal intersection 
with dY we identify parallel arcs of 7 n Y and define 7^(7) to be the union of 
these arcs and any closed curves in 7 n dY / . 

In the last case, it follows from the definition of C'(Y) that 71^(7) is a subset of C (F) 
with diameter at most one. Thus whenever 71^(7) / 0, it is a subset of C'(Y) of diameter 
at most one. Moreover, when £(S) > 0, there is a map 4>y : C' (Y) — > 2 Co< - Y \ which 
implies that the arc complex embeds into the curve complex as a cobounded subset — the 
map (py sends each arc to the boundary curves of a regular neighborhood of its union 
with dY. For Y C S with £(F) > we define Try = 4> Y o 7T y : C(5) -> C(F) (this map 
is actually to the power set of C(F), but for the sake of readability and since the map is 
to bounded sets, we abuse notation and simply write C(Y)). 

When £(Y) = —I any curve 7 which crosses Y transversally has a lift 7 E Y with at 
least one component which connects the two boundary components of Y. Together, the 
collection of lifts which connect the boundary components form a finite subset of C'(Y) 
with diameter at most 1, define 7Tf(7) to be this set. If 7 doesn't intersect Y or is the 
core curve of F, then define 7ry(7) = 0. Also, for consistency we define 7Ty : v — ► {v} 
forveCo(F). 

2 This definition differs from that given by Harer [27] where an arc complex is considered 
consisting of only arcs and thus does not contain C(S) as a subcomplex. Our definition agrees 
with that in [43]. 
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Since we often work with subsets of C(Y) rather than points, we use the following 
notation. For a set valued map / : X — > 2 Y and a set A C X we define /(A) = Li x eAf( x )- 
If X C S and /i, 1/ G C(S) we adopt the notation 

dxO, ^) = dcQQi^xili), 7rx(7)). 

Further, given sets A,Be C(5) we set dx(A,B) = mm{d x (a, (3) : a £ A and /3 G S}. 
Also, we write diamx(A) to refer to the diameter of the set nc(x)(A) and diamx(A, B) 
for diamc(x)(A U B) in order to emphasize the symmetry between our use of minimal 
distance and diameter. (These same conventions apply when considering markings, as 
defined in the next section.) 

An extremely useful result concerning these projections is the following (see [43] for 
the original proof, and [45] where the bound is corrected from 2 to 3): 

Lemma 3.3 (Lipschitz projection [43]) Let Y be a subdomain of S. For any simplex 
p in C(S), ifirrip) + then diamy(p) < 3. 

In light of this lemma, for any pair of domains Y C Z we consider C(Z) \ B\(dY) 
with the path metric, ie, the distance between 7,7' G C(Z) \ B\{dY) is the length 
of the shortest path connecting them in C(Z) \ B\{dY). With this metric there is a 
coarsely Lipschitz map -kz~*y '■ C(Z) \ B\(dY) — > C(Y), where a map is defined to be 
(K, C)-coarsely Lipschitz if and only if there exist a pair of constants K, C such that for 
each a,b G C(Z) \ B\(dY) we have Jy(7Tz^y(a), 7rz^y(^)) < Kd C (z)\Bi(dY)( a -,b) + C. 
By the above lemma and the fact that ttz^y sends points of C(Z) \ B\{dY) to subsets 
of C(Y) of diameter at most 3, we have the following which we shall use in Section 4. 
(See Figure 1 for a cartoon of the behavior of this map near dY .) 

Corollary 3.4 For any domain Y C Z, endowing C(Z) \ Bi(dY) with the path metric, 
we have 

vr z ^y: C 1 (Z)\B 1 (8Y)^C(Y) 
is coarsely Lipschitz (with constants K = 3 and C = 3). 

In Figure 1, we observe that although 7Tz^y is Lipschitz, pairs of points like 71 and 
75, although distance 2 in C(Z), can be found which are arbitrarily far apart in the path 
metric on C(Z) \ Bi(dY) and thus their distance when projected to C(Y) axe also made 
arbitrarily large. 

Letting Ci(5) denote the one-skeleton of C(S), we state the following theorem which is 
one of the key technical results from [43]. 
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Theorem 3.5 (Bounded geodesic image [43]) Let Y C S with £(Y) / and let g be 
a geodesic segment, ray, or line in Ci(S), such that 7ry(v) ^ for every vertex v of g. 
There is a constant D depending only on £(S) so that dianiy(g) < D. 



3.2 Markings 

In this section we describe the quasi-isometric model we will use for the mapping class 
group and explain the tools developed in [43] for computing with this model. 

Definition 3.6 A marking, n , on S is a collection of base curves to each of which 
we (may) associate a transverse curve. These collections are made subject to the 
constraints: 

• The base curves, base(^) = {71 , . . . , 7«} , form a simplex in C(S). 

• The transverse curve, t, associated to a given base curve 7 is either empty or an 
element of Co(S) which intersects t once (or twice if S = Sq^) and projects to a 
subset of diameter at most 1 in the annular complex £(7). 

When the transverse curve t is empty we say that 7 doesn't have a transverse curve. 
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If the simplex formed by base( / u) in C(S) is top dimensional and every curve has a 
transverse curve, then we say the marking is complete. 

If a marking has the following two properties, then we say the marking is clean. First, 
for each 7 € base p its transversal t is disjoint from the rest of the base curves. Second, 
for each 7 and t as above, t U 7 fills a surface denoted F(t, 7) with £(F(t, 7)) = 1 and 
in which dc(F(t,-y))(t, 7) = 1 ■ (See Definition 3.2.) 




7i 



Figure 2: A complete clean marking € .M(Si,2) with base(^i) = {71 , 72} 

Let fj, denote a complete clean marking with pairs (7,-, £(7;)), we take as elementary 
moves the following two relations on the set of complete clean markings: 

(1) Twist: For some i, we replace (7,-, ?(7,-)) by (7,, ^(7,)) where ^(7,) is the result 
of one full (or half when possible) twist of £(7,-) around 7;. The rest of the pairs 
are left unchanged. 

(2) Flip: For some i we swap the roles of the base and transverse curves between 7; 
and ?(7,-) . After doing this the complete marking may no longer be clean, one 
then needs to replace the new marking with a compatible clean one. 

We say the clean marking p, is compatible with p! if they have the same base curves, 
each base curve 7 has a transverse curve t in one marking if and only if it has a 
transverse curve t 1 in the other marking, and when t exists then d^(t, t 1 ) is minimal 
among all choices of t' . 

In [43] it is shown that there exists a bound (depending only on the topological type of 
S) on the number of clean markings which are compatible with any other given marking. 
Thus even though the flip move is defined by choosing an arbitrary compatible complete 
clean marking, it is canonical up to some uniformly bounded amount of ambiguity. 

One then defines the marking complex, denoted M.{S), to be the graph formed by taking 
the complete clean markings on S as vertices and connecting two vertices by an edge 
if they differ by an elementary move. It is not hard to check that Ai(S) is a locally 
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finite graph and that the mapping class group acts on it cocompactly and properly 
discontinuously. A consequence of this which we use throughout this work is: 

Corollary 3.7 [43] Ai(S) is quasi-isometric to the mapping class group. 

For any subsurface Y C S with C(Y) ^ we considered in the previous section the 
subsurface projection, iry : C(S) —> 2 C(r) . More generally one can consider subsurface 
projections from the marking complex; since this definition generalizes the above 
map we also denote it iry : A4(S) — > 2 C(Y K When fi G M(S) we define iry(p) to be 
7ry(base(^)) unless Y is an annulus around a curve 7 G base(^x), in which case we 
define 7Ty(/i) = 7Ty(?) where t is the transverse curve to 7. An important observation is 
that when p is a complete marking on S then TTy(p) 7^ for each Y C S. 

There is also a Lipschitz projection property for projections of markings. 

Lemma 3.8 (Elementary move projection [43]) If p, pf G M(S) differ by one 
elementary move, then for any domain Y C S with £(F) 7^ 0, 

dy(p, p') < 4. 

3.3 Hierarchies 

Thurston's classification theorem for surface homeomorphisms [53] gives a layered 
structure for homeomorphisms based upon studying subsurfaces which are preserved 
(perhaps under an iterate of the homeomorphism). In [43], Masur and Minsky refine 
and further elucidate a layered structure for the mapping class group. Using the marking 
complex and the complex of curves, they provide a way to compare how relatively 
complicated two given mapping class group elements are on any subsurface (not just 
ones that are eventually periodic!). Towards this end, they introduce an object called a 
hierarchy which ties all the subsurface comparisons together. 

Definition 3.9 Let Y C S with £(7) > 1 . A sequence of simplices Vo, vi , . . . , v„ is 
called tight if: 

(1) For each < i < j < n, and vertices v- C V; and vj C vj, 

driy'^v'j) = \i-j\. 

(2) For each < i < n we have that v,- is the boundary of the subsurface filled by 
V/-1 U V/+1. 
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When £(Y) = 1 we consider a sequence to be tight if and only if it is the vertex sequence 
of a geodesic. 

When £(Y) = — 1 we consider a sequence to be tight if and only if it is the vertex 
sequence of a geodesic where the set of endpoints on dY of arcs representing the 
vertices equals the set of endpoints of the first and last arc. 



We often use the following decorated version of a tight sequence. 



Definition 3.10 A tight geodesic g in C(Y) consists of a tight sequence vo, vi . . . , v„ 
and a pair of markings I = 1(g) and T = T(g) (called the initial and terminal markings 
for g) such that vo is a subset of the simplex base(I) and v n is a subset of base(T). 

The integer n is called the length of g. The domain (sometimes called the support) of g 
refers to the surface Y, written D(g) = Y. 

Below we explain a relationship between tight geodesies occurring in different subsur- 
faces. 

When p, £ M(S) we write /j,\y to denote the restriction of this marking to Y, by which 
we mean: 

. If £(*0 = -l,then/x| y = 7ry(/x). 

• Otherwise, we let /u| y be the set of base curves which meet Y essentially, each 
taken with their associated transversal. 

For a surface Y with £(F) > 1 and a simplex v C C(Y), we say that X is a component 
domain of (Y, v) if either: X is a component of Y \ v or X is an annulus whose core 
curve is a component of v. More generally, we say that a subsurface X C Y is a 
component domain of g if for some simplex v,- in g, X is a component domain of 
(D(g), vt). Notice that v; is uniquely determined by g and X. 

Furthermore, when X is a component domain of g and £(X) ^ 0, define the initial 
marking ofX relative to g: 



KX,g) = 



Vi-i\x Vi is not the first vertex (of g) 
1(g) I x vt is the first vertex 



Similarly define the terminal marking ofX relative to g to be: 
T{X,g) = 



v/+i \x Vi is not the last vertex 
T(g)|x Vi is the last vertex 
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Observe that these are each markings, since dX is distance 1 in C(S) from each of vy-i , 
or in the case where v, is the first vertex, then dX is disjoint from base(I(g)) (similarly 
for the terminal markings). 

When X is a component domain of g with T(X, g) ^ , then we say that X is directly 
forward subordinate to g , written X ^ g . Similarly, when l(X, g) ^ we say that X is 
directly backward subordinate to g, written g ~/ X. 

The definition generalizes to geodesies as follows. 

Definition 3.11 Let g and h be tight geodesies. We say that g is directly forward 
subordinate to h, written g"4^ h, when D(g) ^ h and T(g) = T(D(g),h). Similarly, 
h is directly backward subordinate to g, written g ^ h, when g ^ D(h) and 
I(fc) = I(£>(70,g). 

We write forward subordinate, or \, to denote the transitive closure of ^ ; similarly 
we define / . 

We can now define the main tool which was introduced in [43]. 

Definition 3.12 A hierarchy (of geodesies) H, on S is a collection of tight geodesies 
subject to the following constraints: 

(1) There exists a tight geodesic whose support is S. This geodesic is called the main 
geodesic and is often denoted gn ■ The initial and terminal markings of gn are 
denoted 1(H) and T(H). 

(2) Whenever there exists a pair of tight geodesies g,k G H and a subsurface Y C S 
such that g Y ^ k then H contains a unique tight geodesic h with domain Y 
such that g $s h^ k. 

(3) For each geodesic h G H other than the main geodesic, there exists g, k G H so 
that g^ h^i k. 

Using an inductive argument, it is proved in [43] that given any two markings on a 
surface, there is a hierarchy which has initial marking one of them and terminal marking 
the other. The process begins with picking a base curve of the initial marking, one in 
the terminal marking, and a geodesic in C(S) between them (the main geodesic). Then, 
the second condition in the definition of a hierarchy forces certain proper subdomains 
to support geodesies: if there is a configuration g ^ D "4, k such that D does not 
support any geodesic h with 1(h) = I(D, g) and T(h) = T(D, k), then we construct such 
a geodesic. When this geodesies is constructed, we can choose the initial vertex to be 
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any element of baseI(F, g) (similarly for the terminal vertex). This process continues 
until enough geodesic are included in H so that the second and third conditions are 
satisfied. As the above suggests, there is in general not just one hierarchy, but many 
of them connecting any pair of markings. In our proof of Theorem 4.3 (Projection 
estimates) we will exploit this flexibility by building our hierarchy subject to certain 
additional constraints which we find useful. 

When we consider several hierarchies at once, we use the notation gnj to denote the 
geodesic of H supported on Y, when this geodesic exists it is unique by Theorem 3.13. 

For any domain Y C S and hierarchy H, the backward and forward sequences are given 
respectively by 

£-(Y) = {b g H : Y C D(b) and I(b)\ Y + 0} 

and S+(F) = {feff:7C D(b) and T(f)\ Y 0}. 

The following theorem summarizes some results which are useful for making computa- 
tions with hierarchies. 

Theorem 3.13 (Structure of Sigma [43]) Let H be a hierarchy, and Y any domain in 
its support. 

(1) If £#(F) is nonempty then it has the form of a sequence 

fo ^ • ■ • ^ fn = gH, 
where n > 0. Similarly, if ^(Y) is nonempty then it has the form of a sequence 

gH = b m d / ■■■ d / b Q , 

where m > . 

(2) If S^(F) are both nonempty and £(F) / 0, then bo =fo, and Y intersects every 
vertex offo nontrivially. 

(3) If Y is a component domain of a geodesic k G H and £(F) ^ 0, then 

feT,+(Y) ^ Y\f, 

and similarly, 

beZ H (Y) ^ b/Y. 

If, furthermore, T 1 ± {Y) are both nonempty, then in fact Y is the support of the 
geodesic bo =fo- 

Qeometry & Topology 10 (2006) 



Asymptotic geometry of the mapping class group 



1543 



(4) Geodesies in H are determined by their supports. That is, if D(h) = D(h') for 
h,ti G H then h = ti . 

Given a hierarchy, the following provides a useful criterion for determining when a 
domain is the support of some geodesic in that hierarchy. 

Lemma 3.14 (Large links [43]) If Y is any domain in S and 

d Y (I(H), T{H)) > M 2 , 

then Y is the support of a geodesic h in H, where Mj only depends on the topological 
type of S. 

The geodesies in a hierarchy admit a partial ordering, which generalizes both the linear 
ordering on vertices in a geodesic and the ordering coming from forward and backwards 
subordinacy. Below we recall the basic definitions and a few properties of this ordering. 

Given a geodesic g in C\(S) and a subsurface Y, define the footprint of Y in g, denoted 
4> g (Y) to be the collection of vertices of g which are disjoint from Y. It is easy to see 
that the diameter of this set is at most 2 and with a little work it can be shown that the 
footprint is always an interval (of diameter at most 2). The proof that 4> g (Y) is interval 
uses the assumption that the geodesic is tight: this result is the only place where the 
tightness assumption on geodesies is utilized. That footprints form intervals is useful as 
it allows one to make the following definition. 

Definition 3.15 For a pair of geodesies g,k G H we say g precedes k in the time order, 
or g -< t k, if there exists a geodesic m G H so that D(g) and D(k) are both subsets of 
D(m) and 

max c/) m (D(g)) < mm4> m (D(k)). 
We call m the comparison geodesic. 

Time ordering is a (strict) partial ordering on geodesies in a hierarchy, this is proven in 
[43]. It is worth remarking that when a pair of geodesies are time ordered, they are time 
ordered with respect to a unique comparison geodesic. 

The following provides a way to use the time ordering to gain information about the 
hierarchy. 

Lemma 3.16 (Order and projections [43]) Let H be a hierarchy in S and h,k G H 
with D(h) = Y and D(k) = Z. Suppose that Y 'HZ / and neither domain is contained 
in the other. Under these conditions, if k -< t h then dy(dZ, 1(H)) < M\ + 2 and 
dz(T(H), dY) < M\ + 2. The constant Mi only depends on the topological type of S. 
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Since we will often use the above hypothesis, we introduce the terminology that a pair of 
subsurfaces Y and Z of S overlap when Y n Z / and neither domain is contained in 
the other. A application of this lemma which we shall often use is: given h\ , fi2, k G H 
with D(hj) = Z, and D(ifc) = Y and for which /z,- -« f it, then d Y (dZi,dZ 2 ) < 2{M\ + 2). 

The next result provides a way to translate distance computations in the mapping class 
group to computations in curve complexes of subsurfaces. 

Theorem 3.17 (Move distance and projections [43]) There exists a constant t(S) 
such that for each p,, v G M. and any threshold t > t(S), there exists K(= K(t)) and 
C{= C(0) such that: 

—d M (fj,,v)-C< ^2 d Y {ii,v)<KdM(P;V) + C. 

YCS 
dy(ll,v)>t 

The importance of Theorem 3.17 can not be understated, as this theorem provides the 
crucial result that hierarchies give rise to quasi-geodesic paths in the mapping class 
group. We now indicate this construction. 

Given a hierarchy H, [43] define a slice in H to be a set r of pairs (h, v), where h G H 
and v is a vertex of h, satisfying the following conditions: 

(1) A geodesic h appears in at most one pair of r. 

(2) There is a distinguished pair {hn,v T ) appearing in r, where Iih is the main 
geodesies of H . 

(3) For every (k, w) G r other than (Iih, v r ), D(k) is a component domain of (D(h), v) 
for some (h, v) G r. 

(4) Given (h, v) G r, for every component domain Y of (D(h),v) there is a pair 
(k, w) G r with D(jfc) = F. 

To any slice r is associated a unique marking as follows. All the vertices which 
appear in geodesies of non-annular subsurfaces are distinct and disjoint, so these form a 
maximal simplex of C(S) which we take to be base(/i r ). Then, by the forth condition, 
r must contain a pair of the form (hy,t) for each 7 G base(/i r ). Letting t be the 
transversal for 7 we obtain a complete marking, which we denote \i T . This marking 
may not be clean, but a lemma of [43] implies that a complete clean marking can be 
found in a uniformly bounded neighborhood; thus we assume fj, T G Ai(S). 

3 This is called a complete slice in [43]. 
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The set of slices admits a partial ordering (relating to the time ordering discussed 
above), which then descends to provide a partial ordering on the markings p T for a 
given hierarchy H. In [43] it is shown that slices nearby in the slice ordering yield 
markings which are close in Ai(S); combined with 3.17 these sequences of markings 
form quasi-geodesic paths in M(S), called hierarchy paths. 

3.4 Teichmuller space and the pants complex 

For a detailed reference on Teichmuller space, consult Imayoshi and Taniguchi [32]. 

Definition 3.18 For a fixed open topological surface 5, the Teichmuller space of S 
is the space of equivalence classes of pairs (X,f) , where X is a finite area hyperbolic 
surface and /: S — > X is a homeomorphism. A pair (Xi ,/i) and (X2J2) are considered 
equivalent if there exists an isometry h : ^1^X2 such that h of 2 is homotopic to/i . 

A topology is obtained on Teichmuller space by infimizing over the distortion of maps 
h : X\ — ► X2 ■ Topologically Teichmuller space is fairly easily understood, as the 
following classical result indicates: 

Theorem 3.19 Teichmuller space is homeomorphic to M. 6g ~ 6+2p . 

There are several natural metrics on Teichmuller space and its metric structure is far 
less transparent than its topological structure. Often of interest is the natural complex 
structure which Teichmuller space carries. What will be important in the sequel is the 
Weil-Petersson metric on Teichmuller space, which is a Kahler metric with negative 
sectional curvature. We will not need the integral form of the definition, so we just 
mention that the metric is obtained by considering a natural identification between the 
complex cotangent space of Teichmuller space and the space of holomorphic quadratic 
differentials, then defining the Weil-Petersson metric to be the one dual to the L 2 -inner 
product on the space of holomorphic quadratic differentials. See Wolpert [54] for a 
survey on the Weil-Petersson metric and its completion. 

Definition 3.20 A pair of pants is a thrice punctured sphere. A pants decomposition 
of a surface S g>p is a maximal collection of pairwise non-parallel homotopy classes of 
simple closed curves; this decomposition obtains its name from the observation that 
such a curve system cuts S gJ) into 2g + p — 2 pairs of pants. 
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It is easy to verify that any pants decomposition consists of exactly 3g + p — 3 disjoint 
simple closed curves on S and is thus a maximal simplex in C(S). These decompositions 
have long been useful in the study of the mapping class group and Teichmiiller space. 

Originally defined by Hatcher [29], the pants complex of S, denoted V(S), is a way of 
comparing all possible pants decompositions on a fixed surface. This complex consists of 
a vertex for every pants decomposition and an edge between each pair of decompositions 
which differ by an elementary move. Two pairs of pants P = {71 , . . . , 73 g + P -3} and 
P' = {7j , . . . , 73g +p _3} differ by an elementary move if P and P' can be reindexed so 
that both: 

(1) 7( . = 7 ; for all 2 < i < 3g +p - 3 

(2) In the component T of S \ U2<!<3g+ p -37;- which is not a thrice punctured sphere 
(T is necessarily either a once punctured torus or a 4 punctured sphere) we have 
(see item 3.4): 




Figure 3: Elementary moves in the pants complex 

V(S) is metrized by giving each edge the metric of the Euclidean interval [0, 1] . Our 
interest in this space comes out of the following remarkable theorem of Brock: 

Theorem 3.21 [14] V(S) is quasi-isometric to the Teichmiiller space of S with the 
Weil-Petersson metric. 

Remark 3.22 Noticing that a marking without transverse data is just a pants decom- 
position, in Section 8 of [43] it is remarked that all of the constructions in their paper 
using hierarchies to obtain results about the marking complex can be replaced with 
analogous theorems for the pants complex. This is done by replacing markings (and 
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hierarchies, etc) by markings without transverse data (and hierarchies-without-annuli, 
etc). We will often implicitly use this result to note that results we prove from the 
mapping class group also hold in Teichmuller space (in particular, this allows one to 
obtain Teichmuller space analogues of the results in Sections 5 and 6). The following is 
the main result of this form which we will explicitly use. 

Theorem 3.23 (Move distance and projections for V(S) [43]) There exists a constant 
t(S) such that for each /i, v € V (and any threshold t > t{S) ) there exists K(= K(t)) 
and C(= C{f)) such that: 

—d-p(u., v) - C ^2 d,Y(n, v) < Kd-p(/i, v) + C. 

non-annular YCS 

d Y (p,M)>t 

4 Projection estimates 

In this section we address the following question: given a pair of subsurfaces X\,X2 
of S, what is the image of M.(S) under the map which projects markings into the 
product of curve complexes of these subsurfaces, ie, the image of M.(S) as a subset of 
C(X\) x C(X2)1 We answer this in Theorems 4.3 and 4.4 using hierarchies in the curve 
complex as our main tool (see Section 3.1 for background). The geometric description 
underlying Theorem 4.4 can be generalized to arbitrary finite collections of subsurfaces 
by introducing an notion called a coarse tree-flat and using a delicate inductive step, 
this analysis is carried out in [3]. 

We begin this section with a pair of toy examples which illustrate the dichotomy of 
Theorem 4.3 (Projection estimates), but which can be proven without use of the full 
hierarchy machinery. In Section 4.2 we prove Theorem 4.3 and its geometric analogue 
Theorem 4.4. 

4.1 A motivating example: the dichotomy 

Here we give two computations on the genus three surface. We exhibit a dichotomy 
between the behavior of projection maps into disjoint and intersecting pairs of subsur- 
faces. These examples illustrate the geometric meaning of Theorem 4.3 and provide a 
useful warm up for its proof, as the arguments we use here are a particularly easy form 
of what in general is a technical argument involving hierarchies. 
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Example 4.1 (Disjoint subsurfaces) Let S be a closed genus three surface and let 
X and Y be subsurfaces of S which are each once punctured tori and which can be 
realized disjointly on S (Example 4.1). 



X 




Y 



Figure 4: S and two disjoint subsurfaces 

Let a G C(X) and (3 S C(Y). Note that both these curves can also be considered as 
elements of C(S), then since X n Y = it follows that dc(s)(a, f3) = I. Thus a and (3 
are vertices of a common top dimensional simplex fi C C(S) . Taking a complete clean 
marking y! £ A^S) with base(/z') = jti.we have that the map 

.M(S) -» C(X) x C(F) 

is onto. 

The next example provides the more interesting half of the dichotomy. 

Example 4.2 (Overlapping subsurfaces) Let S be the closed genus three surface 
and consider two subsurfaces X and Y which each have genus two and one puncture. 
Moreover, suppose that X and Y overlap in a twice punctured torus, Z. Letting n be an 
element of C(S) which intersects both X and Y. We shall show that both dx(y, BY) 
and dy(fi, dX) cannot simultaneously be large. 



X 




Figure 5: S and two overlapping subsurfaces 

Let g be a geodesic in C\(S) connecting [i to dX, with vertices vo, vi , . . . , v„ where 
vq = /j, and v n = dX. Since g is a geodesic in C(S), the curve v n -\ may be disjoint 
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from X, but for all k < n — 1 we have v# n dX ^ and in particular, irxiyid an d 
7rr(vyt) 7^ 0- Since 5 = X U F we have either v„_i n Y ^ or v„_i C X. 

If v„_i n Y ^ 0, then every vertex of g intersects Y non-trivially. Theorem 3.5 
(Bounded geodesic image [43]) states that if g is a geodesic in Ci(S) with every vertex 
non-trivially intersecting a subsurface Y C S, then diamy(7Ty(g)) < K for a constant A' 
depending only on the topological type of S. Accordingly, this theorem implies that we 
have dy(y, dX) < K for a constant K depending only on the genus of S. 

If v„_i fi y = , then it must be the case that v„_i C X. In this case, we have v^nZ / 
for each < k < n — 1 . Applying the Bounded Geodesic Image Theorem tells us that 
dx(y, v n _i) < K. Since v M _i D y = 0, we have in particular that v M _i n 3y = which 
implies that dx(v n -i, dY) < 3. Thus we may conclude that dx(n, BY) < K + 3. 

Putting the two cases together, this example shows that there exists a constant C = K+3 , 
for which any /i G C(5) must satisfy either dx([i, dY) < C or cfy(/i, 5X) < C. 

If we take fi to be one of the base curves for some complete clean marking v of S, we 
have shown that for any marking v <E M.(S) either 

d x (v, dY) < C or d Y (y, dX) < C. 

We consider this a toy example, since the only tool we needed was the Bounded geodesic 
image Theorem. Difficulties in generalizing this example to arbitrary overlapping 
subsurfaces include problems such as: the boundaries of these subsurfaces may fill S or 
the union of these subsurfaces may not be all of S — both properties that were crucial 
to the above argument. To deal with these problems we use a powerful technical tool 
called a hierarchy, in which one considers not just a geodesic in the curve complex of 
5, but a family of geodesies each in the curve complex of some subsurface of 5. The 
tools introduced by Masur and Minsky for calculating with these allow one to give the 
argument in the next section which is obtained by bootstrapping the key idea used in 
the above example. 

4.2 Projection estimates theorem 

Theorem 4.3 (Projection estimates) Let Y and Z be two overlapping surfaces in S, 
with £(Y) + / £(Z), then for any y. e M(S) : 

d Y (8Z, fi)>M => d z (dY, ff) < M, 

for a constant M depending only on the topological type of S. 
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Proof Let M\ and M2 be given by Lemmas 3.14 and 3.16 (Large link Lemma and 
Order and projections Lemma [43]). Define: 

M = max{Mi + 2,M 2 + 3}. 

We now fix a point G M for which dy([i, dZ) > M and will show that this implies a 
bound on <fe(M> 9Y). 

Consider a hierarchy H with base(7(//)) D 7Tz(/x) U <9Z and = (when Z is an 
annulus take base(/(//)) = dZ with transversal vrz(/x)). Furthermore, we assume // is 
built subject to the following two constraints. (See Section 3.3 for a discussion of the 
choices involved when building a hierarchy.) 

• Choose the initial vertex of gn to be an element of dZ which 
intersects Y . Call this vertex vo. 

• When adding a geodesic to H with a given initial marking I(Y, g), if 
Vo G base(/(F, g)) choose this as the initial vertex for the geodesic. 
If vo base(/(F, g)), then if any elements of dZ are in base(/(F,g)) 
choose one of these to be the initial vertex for the geodesic. 

The proof is now broken into four steps. We show there exists a geodesic k G H with 
domain Z and a geodesic h G H with domain Y. Then we show k -< t h, from which 
the theorem follows as a consequence of Lemma 3.16 (Order and projections [43]). 

Step (i) There exists a geodesic k with domain Z. 

We start by considering the forward and backward sequences, £#(Z) and S^(Z), as 
denned in the discussion preceding Theorem 3.13 (Structure of Sigma [43]). 

Since [i is complete we have 7Tz(/x) 7^ ; thus both the initial and terminal markings 
restrict to give nontrivial markings on Z (when Z is an annulus, we are using that 1(H) 
has a transverse curve). This implies for any subsurface Q^_Z that 11^(0 and S^(® 
both contain gu and thus in particular are nonempty. Theorem 3.13 shows that when Q 
is a component domain of a geodesic in H and both S^(0 and £^(<2) are non-empty, 
then Q must be the support of a geodesic in H. 

Since the first vertex of gu is vo G dZ, we know it has a component domain Q\ C S 
which contains Z. By the above observation, we know that £^(<2i) and ^(Qi) are 
both non-empty, and thus Q\ supports a geodesic which we call k\ . If Q\ =Z we have 
produced a geodesic supported on Z and are done, otherwise since Q\ 2 Z we can 
choose an element of dZ as the first vertex of the geodesic supported in Q\ . 
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Starting from the base case Qo = S and ko = gH, the above argument produces a 
sequence of properly nesting subsurfaces S = Qo 2 Qi 2 • • • 2 Qn = Z where for 
each i > the subsurface <2, is a component domain of a geodesic i G H supported 
in Qi-\ and each of the fe,- has an element of SZ as an initial vertex. 

Since S is a surface of finite type, the above sequence of nested surfaces must terminate 
with Z after finitely many steps. 

Thus Z is a component domain of a geodesic in S and supports a geodesic which we 
call k. 

Step (ii) There exists a geodesic h with domain Y . 

By hypothesis we have dy(dZ, u) > M 2 + 3 and thus: 

d Y (I(H), T{H)) = d Y {7r z (fi) U 8Z, n)>M 2 . 

Together with Lemma 3.14 (Large link Lemma [43]), this implies Y is the support of a 
geodesic h in H. 

Step (iii) k -< t h 

In this step we will prove that the geodesic k precedes h in the time ordering on 
geodesies in H. 

Since D(k) = Z and D{h) = Y and each are contained in D{gu) = S: if max0 gH (Z) 
< min(fig H (Y), then k -< t h which is what we wanted to prove. In the general case, 
we provide an inductive procedure to show that these geodesies have the desired time 
ordering. 

We refer to the ordered vertices of g as v,(g) ■ Recall that by the first part of the constraint 
we have Vo(g#) = Vo, where vq was chosen to satisfy vo G dZ and vo fl Y ^ 0. Also, 
since gn is a tight geodesic, v\(gH) = dF(vo(gH), V2(?#)) where F(a, [3) denotes the 
surface filled by a and 0. 

Summarizing, we have: 

• Y n v / , and thus v 4> gH (Y) . 

• Z is contained in a component domain of vo, thus vo G 4> gH {Z). 

• Since gH is a geodesic, we know V2(g#) must intersect vo(G dZ). Since the 
diameter of a footprint is at most 2, it now follows that max 4> gH (Z) < vi(gH)- 
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Together these imply: 

max4> gH (Z) < vi(g H ) and v x (g H ) < min</> to (F). 
Now there are two mutually exclusive cases to consider: 

(1) v 1 (g H )nZ^0orv 1 (g H )nF^0 

(2) Vl (g H )nz = $ = v 1 (g H )nY. 

In the first case, depending on which of the two sets is non-empty, we obtain v\(gH) g" 
4> gH (Z) or vi(gn) G" <p gH (Y), respectively. Either of these imply max0 ?H (Z) < 
min (j) gH (Y), proving that in case 1 we have k -< t h. 

Observing that in 5i,i and So,4 the footprint of a domain must consist of at most one 
vertex, we have v\(gu) G" </>g w (Z) and thus when S is either of these surfaces we are in 
case 1 and thus k -< t h. As the annulus does not contain any pair of surfaces which 
overlap we may now assume for the rest of the proof that £(S) > 1 . In particular, for 
the remainder of the argument we need not consider the sporadic cases where C(S) has 
a special definition and thus we can assume two distinct homotopy classes of curves 
have distance one in the curve complex if and only if they can be realized disjointly. 

For the remainder of this step we assume that vi(g#) G <fig H (Z) and v\(gn) G <l>g H (Y) 
and will prove that we still obtain k -< t h. Since v\(gji) H Z = there is a component 
of S \ v\(gn) which contains Z and since Z n Y / the same component contains Y as 
well — we will call this component W\ . Since W\ is a component domain of H which 
intersects both 1(H) and T(H) (since W\ D Z), Theorem 3.13 (Structure of Sigma) 
implies that it supports a geodesic l\ . 

Since W\ is a component domain of vi(g#), we have vo G I{W\,gH)- Thus, by our 
convention for choosing geodesies in H, we choose vq(/i) =vq. As before we have 
v (/i) G 4> h (Z), v 2 (/i) 0/,(Z), and v (/i) <j> h (Y). If vi(Zi) is in both <j> h {Z) and 
0/,(y) then we again restrict ourselves to the appropriate component W2 of W\ \ v\(l\) 
and repeat the argument with W2 and lj . This gives a properly nested collection 
of subsurfaces each of which contains Y U Z, so the process terminates in a finite 
number of steps to produce a geodesic /„ with domain W n 3 Y U Z and which satisfies 
vo(ln) = v G 0/„(Z), v 2 (l n ) 0/„(Z), v (/„) 0/„(F), and either vi(l n ) </>,,, (Z) or 
vi(/,i) <fti„(Y)- Hence max^)/„(Z) < min0/ n (y), and we have k -< t h with comparison 
geodesic /„. 

Step (iv) Conclusion 

We have now produced a hierarchy H with geodesies k and /z with D(h) = Y, 
D(k) = Z, and k -< t h. Thus Lemma 3.16 (Order and projections [43]) implies that 
d z (dYJ(H)) <Mi +2. 
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Since T(H) = fj,, this implies dz(dY, fi) < M\ + 2 which is exactly what we wanted to 
show. □ 

It would be interesting to have a more elementary proof of Theorem 4.3, although the 
given argument is useful in that it provides the reader an introduction to the tools we 
need for the later parts of this work. C Leininger has suggested to me ideas for a more 
constructive proof. 

4.2.1 Geometry of projection estimates 

The main case of the following result is an immediate corollary of Theorem 4.3 
(Projection estimates); we give this restatement because it emphasizes the geometry 
underlying that result. 

Theorem 4.4 (Projection estimates, geometric version) For any distinct subsurfaces 
Y and Z of S, exactly one of the following holds for the map 

vry x vrz : M(S) C(Y) x C(Z) 

(1) Y n Z = in which case the map is onto. 

(2) One of the surfaces is contained inside the other, say Z D Y. Then the image is 
contained in a radius 3 neighborhood of the set 

C{Y) x irz(dY) (J Graph(7r z _y), 

where ttz^y is the projection map from C(Z) \ B\(dY) to C(Y). Furthermore, 
7Ty x -Kz(M(S)) is 6 -dense in this set. 

(3) Y and Z overlap, in which case the image is contained in a radius M = 
max{Mi + 2,M2} neighborhood of the set 

ir ¥ (dZ) x C(Z) (J C(Y) x TT Z (dY). 

The constant M depends only on the topological type of S. Furthermore, 
tty x -Kz(Ai(S)) is 6 ■ M -dense in this set. 

Proof 

Casel FnZ = 

Any pair consisting of a curve in C(Z) and a curve in C(Y) can be completed to a 
complete marking on S so the map is onto. 
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Case 2 Y C Z 

For any /i G M we have that dz{n' z {ii), ttz(/-0) < 1 an d thus when we restrict these 
arcs (curves) to Y we get that ^(vryTr^/u), tt' y ttz(^)) < 1- Applying Lemma 3.3 
(Lipschitz projection [43]), we have dyifp^Y 71 ^^^ ^Y^zi^)) < 3. Thus we have 

d Y (lJ.,iTz^Y ° vr z (/i)) < 3. 

This proves that if Y C Z then 7ry x irz(M-) is contained in a radius 3 neighborhood of 
C(F) x 7T Z (dY) U Graph(vr z ^ F ). 

We now prove the density statement. Let (7, 5) G Graph(7Tz^y) C C(F) x C(Z). Then 
take any [i <E M.(S) which has 5 G base(/i). By Lemma 3.3 we have nz(n) C Bt,(5). 
By hypothesis we have 7 G vry(5), so Lemma 3.3 also shows 7Ty(/i) C -83(7). Thus we 
have (7, 5) C B(,(tty(^) x 7Tz(/i)). Similarly, given any 7 G C(F), since Y C Z we have 
7 and <9K are disjoint. Thus if fi G _M(S) is chosen so that both 7 and <9K are in base(fi) , 
we have 7r z (» C 63(7) and 7Ty(/x) C 63(7). Thus (<y,iTz(dY)) C B 6 (tt ¥ (v) x 7r z (/i)). 
Thus we have shown that C(Y) x 7rz(5F) |J Graph(7rz^y) C B(,(ity x 7rz(.M)). 

Case 3 Y n Z, F £ Z, and Z £ Y 

If ^ G .A/US) projects to C(Y) with d Y (dZ,fi) > M then Theorem 4.3 (Projection 
estimates) implies that d z (dY, //) < M. Thus the image of the map M(S) -> C(F) x C(Z) 
is contained in the union of the radius M neighborhood of TTy(dZ) x C(Z) with the 
radius M neighborhood of C(Y) x 7rz(<9F) as claimed. 

Given any 7 G C(Y), Lemma 3.3 implies that diam z (7r z (7), ir z (dY)) < 3. If fj, G -M(S) 
is chosen so that both 7 and <9F are in base(/i), then we have 7Tz(/u) C B^zidY)) and 
7ry(/^) C 83(7). Thus (7,vrz(3F)) C B^ityi^) x 7Tz(/u)). Thus we have shown that 
C(F) x 7Tz(0F) C B(,(tty x 7Tz(7W)). The identical argument with the roles of Y and Z 
reversed completes the proof of the density statement. □ 

Remark 4.5 In the above proposition we see that except in the case where Y n Z = , 
there are uniform bounds for which the image of the map from C(S) to C(Y) x C(Z) 
lies in a neighborhood of the 5-hyperbolic space formed from the complexes of curves 
for Y and Z "joined together." The two complexes are glued together along a bounded 
diameter set in the non-nesting case. In the case of nesting C(Y) is glued to the link 
of a point in C(Z); essentially this is done by taking a "blow-up" of C(Z) at the point 
dY G C(Z). 5-hyperbolicity in the nesting case follows from a more delicate analysis, 
see [3]. 

Theorem 4.4 (Projection estimates, geometric version) is summarized in Figure 6. For a 
(slightly) more accurate picture of the behavior in the nesting case, see also Figure 1 . 
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znr=0 zny^0 Fez 

non-nested 

Figure 6: A cartoon of the possible images of projections from complete markings into 
C(Z) x C(Y) 

5 Hyperbolicity of the mapping class group and Teichmuller 
space in the cases of low complexity 

In this section we give new proofs of the following theorems. 
Theorem 5.1 V{S\^) and V(Sq : 5) are each 8 -hyperbolic. 
Theorem 5.2 M.(S\ t i) and A^(5o,4) are each 5 -hyperbolic. 

Theorem 5.2 is classical, since M.(Si^) is isomorphic to SL2CZ) and M-iSo^) to a 
finite subgroup of SL2(Z), hence both group are virtually free and thus easily seen to be 
5 -hyperbolic. Theorem 5.1 is a much deeper theorem, and was originally proved by 
Brock-Farb in [16] (see also [2]). Although the Weil-Petersson metric has been known 
for some time to have negative curvature, it was only recently discovered in [31] that 
the sectional curvature is not bounded away from zero, even for the case of ViSi^) 
and "P(5o,5), thereby prohibiting a proof of Theorem 5.1 by a comparison geometry 
argument. 

We find especially tantalizing the phenomenon that these two results about two very 
different spaces can be proven simultaneously. This can be done since the property 
of low complexity which we use is that any two proper subsurfaces (with nontrivial 
curve complex) must overlap: a property of S\ t i and So,4 which is also true for 5^2 
and 5o.5 when one adds the assumption that the subsurfaces are not annuli, a natural 
assumption when one is considering pants decompositions instead of markings (cf. 
Theorem 3.23). Thus the proof of Theorem 5.1 is obtained by rereading the proof for 
Theorem 5.2 reading "marking" as "pants decomposition," "hierarchy" as "hierarchy 
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without annuli," and Ai(S) as V(S). The core arguments in this section will be revisited 
to prove Theorem 6.5, a result which holds for surfaces of arbitrary complexity. 

Our method of proof is to show that for every hierarchy path there exists an "almost 
locally constant" map sending the marking (or pants) complex to the hierarchy path. 
That this implies hyperbolicity is a consequence of a general result from [42] ; we also 
provide a new proof of this implication using asymptotic cones (Theorem 5.7). 

5.1 A projection from markings to hierarchies 

Fix a surface S, a pair of complete clean markings / and T, and a hierarchy H connecting 
them. By a hierarchy path, we mean a "resolution" of slices of the hierarchy into a 
sequence of markings separated by elementary moves; this is done by taking the slices 
of H and interpolating between them to get a path. The choice of a hierarchy path is not 
canonical, but they are each (K, C)-quasi-geodesics with the quasi-isometry constants 
depending only on the topological type of the surface. (See [43] , especially Section 5 
and the Efficiency of hierarchies Theorem of Section 6.) 

We say that Y appears as a large domain in H if Y is in the set 

G = {Y C S : d Y (I(H), T(H)) > 6M + 45'}. 

Where M = max{Mi + 2, M2 + 3} is the constant coming from Theorem 4.3 (Projection 
estimates), note in particular that this constant is sufficiently large to satisfy the hypothesis 
of Lemma 3.14 (Large link [43]) and 5' = 45 + 5 where 5 denotes the maximum of the 
5 -hyperbolicity constants for C(Y) where Y C S. Alternatively, when dealing with the 
pants complex we consider: 

Q' = {Y G Q : Y is not an annulus}. 

Letting gjj >Y denote the geodesic segment supported on Y in the hierarchy H, for 
each Y G Q we have a map Ai(S) C(Y) y where r Y is the closest point(s) 

projection from C(Y) to gn,y- We denote the composition 

p Y = r Y o vry : M(S) -> g H) y, 

or just p when the surface Y is understood. Notice that (^-hyperbolicity of C(Y) (via 
Lemma 2.3) combined with the fact that diamy(7Ty(^)) < 3 (Lemma 3.3) imply that 
for any \i G Ai(S), the set py(fJ-) has diameter at most 5' . Accordingly, it follows that 
maps py are coarsely distance decreasing in the sense that: 

(5-1) dc(y)(pY(v),PY{v)) < dciYMrifJ-), ^y(^)) + 26'. 

Define C(p) = {Y G Q : d Y (p Y (fJ-), T(Y)) < 3M + 25'} 

and TKjm) = {Y G Q : d Y (p Y (p,),I(Y)) < 3M + 25'}. 
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Usually the choice of marking p, will be fixed and we will drop the fi from the notation, 
just writing C and 1Z. 

The next lemma summarizes some basic facts we will need about these sets. 

Lemma 5.3 (1) £nft = 

(2) Let Y, Z £ Q. If Y -< t Z and Y and Z overlap, then either YeCorZGK. 

(3) If S is Si t \ or So, a then the set Q \ (X U 1Z) consists of at most one surface. 

(4) If S is S\ t 2 or So,5 then Q'\(£Li 1Z) consists of at most one surface. 

Proof 

(1) If Y € C then d Y (p(p),T) < 3M. If Y is also in K then dr(p(p),I) < 3M. 
But together these imply that the total distance H travels through Y is less than 6M 
contradicting our assumption that Y is a large domain. 

(2) Suppose not. Then we have Y < t Z, Y C, and Z 1Z. 

Since Y and Z are large domains with Y -< t Z, by Lemma 3.16 we have: 

d Y (dZ, T(H)) < Mi + 2 and d z (dY,I{H)) <M { +2. 
Y £ implies that 

drW),T(X)) > 3M + 25'. 

Similarly, Z ^ TZ implies that 

d z (p(v),I(Z))>3M + 25'. 

Putting these facts together with the result from [43] that for any domain 
d D (I(D),I(H)) < Mi , yields the two inequalities: 

d Y (p(n), dZ) > 3M + 25' — M — (Mi +2) > M + 25' 

and dz(p(/Jd, dY) > 3M + 25' - M - (M l + 2) > M + 25'. 

The fact that p Y and /?z are coarsely distance decreasing implies, via Equation 5-1, 
that both d Y (p, dZ) > d Y (p(p), dZ) -25' > M and d z (n, dY) > d z (p(p), dY) - 25' > 
M. But this is impossible, since together these inequalities contradict Theorem 4.3 
(Projection estimates). 

(3) This follows from part 2, since in each of these surfaces any two subsurfaces 
overlap and thus are time ordered by Lemma 4.18 of [43]. 

(4) Again this follows from part 2, since here any two non-annular surfaces must 
overlap. □ 
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Parts 3 and 4 of the above proposition suggests the following construction, which we 
give for the marking complexes for 5i,i and So,4 and also for the pants complexes 
of Si t 2 and So,5 (aspects of this construction generalize to work on arbitrary surfaces, 
we discuss this further in Section 6). Given any pair of markings I,T € M.(S) and 
a hierarchy H connecting them let H denote the set of markings associated to slices 
of H. Below we construct a map $ which maps elements of A4(S) to (uniformly) 
bounded diameter subsets of H (in the cases we are considering, H is metrized by 
a linear time ordering on slices). H be embedded into A4(S) via a quasi-isometric 
embedding which can be extended by resolving the set of slices into a path (referred to 
as a hierarchy path), the map $ induces a map to (uniformly) bounded diameter subsets 
of any hierarchy path from / to T. The map $ is defined to be the identity on markings 
in H; the following defines the map for p, G M.(S) \ H: 

(1) ug\(CUK) = {A} / 0, then define = (A,p A (p)). 

(2) If Q \ (C UK) = 0, then consider 

A = {v£g H : d C ( S ){v, n) < dc(S)(w, fj) for any w G g H }, 

ie, the set of points on the main geodesic gn which are closest to p, namely 
rs(fi). &(p) is defined to be the following sets of markings. 

(a) If any of the surfaces in C are component domains of gu at a vertex in the 
set A, then denote by L the rightmost (the last to appear with respect to the 
time ordering) domain in C, and define ^(/i) to contain each (L, v) where 
v is any vertex of gH,L within 3M + 35' of the terminal marking T. 

(b) If a component domain corresponding to a vertex of gH in the set A is an 
element of 1Z then choose the leftmost element of this set (call this R). 
Define <£(/i) to contain each of the markings (R, v) where v is any vertex 
of gH,R within 3M + 35' of the initial marking /. 

(c) For each domain D which is a component domain of a vertex in A and 
which is not in Q , if D supports a geodesic and it is time ordered after the L 
geodesic and before the R geodesic then define <3?(^) to contain each (D, v) 
where v is any vertex in gup • 

Before proceeding, we mention why (A,pa(^)), (L,v), and (R,v), although not 
necessarily complete clean markings, give rise to such (in the text we abuse notation and 
drop this correspondence from the discussion, thereby simply referring to (A , pa(h)) , etc, 
as complete clean markings). In each case these consist of a proper subsurface X and a 
point in C(X) (we unify the argument that these give markings by writing (X,px(^)))- 
For S^i and So,4, a point of M(S) consist of a curve in C(S) and a transversal to 
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that curve, so it is clear that our prescription above indeed describes a marking since 
the only nontrivial proper subsurfaces are annuli so (X,px(p)) refers to the marking 
(dX , pxifJ.)) ■ The case of V(S) for the surfaces 5^2 and So,5 requires (only slightly) 
more justification. In these cases an element of V(S) is a pair of disjoint curves. Recall 
that the only subsurfaces of S with nontrivial curve complex are once punctured tori and 
four punctured spheres (we ignore annuli when dealing with the pants complex). Thus 
PxilA S C(X) is also an element of C(S). Furthermore, px(p) and dX have distance 1 
in C(S), so (X,px(p>)) is taken to refer to the pants decomposition (dX,px(p)). 

Lemma 5.4 There is a uniform bound D (depending only on the topological type of 
S ), so that for each p G M. the set &(p) has diameter less than D . 

Proof First note that the set of points on the main geodesic gH which are closest to p 
has diameter at most 5' by Lemma 2.3. When Q \ (£ U 1Z) ^ there is only one such 
surface, as proven in Lemma 5.3. In this case it follows from the definition of and 
(5-hyperbolicity of C(A) that &(p) is a subset of H of diameter at most 5' . 

Note that under <1> the marking p can not project to anything time ordered before the 
rightmost element L G C as then Lemma 3.16 would force pl(p) to lie near the initial 
marking of L and then we would have either L G" Q or L G 1Z, either way contradicting 
Lemma 5.3 which proves that every surface in C is time ordered before every surface 
of 1Z; a similar argument gives the analogous result for 1Z. 

So now ^(p) consists of the rightmost element of C which we call L, the leftmost 
element of 1Z which we call R , and all the rest of the small domains supporting geodesies 
which are time ordered between L and R (of which there are at most 5' ). $ was defined 
to be the union of the three set (any of which are possibly empty): 

(1) If L G A then (L, v) where v is any vertex of gn,L within 3M + 35' of the 
terminal marking T. 

(2) If R G A then (R, v) where v is any vertex of gn,R within 3M + 35' of the initial 
marking /. 

(3) If D G A, D £ Q, and L -< t D -< t R then (D, v) where V is any vertex of g HjD . 

First, note that the diameter of the elements in item 1 is at most B(3M + 35'), where B is 
the Lipschitz constant for the map from hierarchy slices to paths in the marking complex. 
(Similarly for the markings in item 2.) Since C(S) is <5-hyperbolic diamc(s)(A) < 5' , 
and thus if both L and R are in A then dc(s)(L, R) < 5' . Also note that for any D G A 
as described in item 3 we have that the diameter in Ai(S) of the set of (D, v) is less 
than B(6M + 45') (since 6M + 45' is the threshold for being in Q). Again, since the 
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diameter in C(S) is bounded by 5' we then see that the set of all markings in item three 
has diameter in the marking complex bounded by 5' ■ B(6M + 45') . 

So now it follows that although ^(/j.) consist of many slices: 

diam M(S) ($(fi) < 2B(3M + 35' + 5'(6M + 45')). □ 

Note that since the map from slices to hierarchy paths is Lipschitz, the above lemma 
yields a map which sends points of M to uniformly bounded diameter subsets of the 
hierarchy path, where the constants are independent of the hierarchy path. 

In the next subsection we will show that these maps to hierarchy paths are "almost 
locally constant" off the hierarchy path, in the sense that there are uniform constants, so 
that large diameter balls around markings far from a hierarchy path are sent to small 
diameter subsets of the hierarchy path. 

5.2 Coarsely contracting projections 

We now describe the tool we use for proving <5-hyperbolicity. It is a generalization to 
our context of Morse's Lemma on stability of quasi-geodesics in hyperbolic space. 

Definition 5.5 In a space X, we say a family of paths H is transitive if every pair of 
points in X can be connected by a path in H . We say H has the coarsely contracting 
property when H is a transitive family of paths in X with the property that for every 
H G H there exists a map : X — ► H and constants b and c such that each for each 
H, p! e X satisfying d x (n, y!) <b- d x (n, ^h(^)), then diam($ ff (^), $ H (^')) < c. 

If considering M(S) then fix S to be either S^i or Sq,4, if considering V(S) then fix S 
to be either Si^ or S^s and use hierarchies- without-annuli. 

Lemma 5.6 

• When S is either S^i or Sq,4 , then the hierarchy paths form a coarsely contracting 
family of paths on M.(S). 

• When S is either S\ : 2 or S 5 , then the hierarchy-without-annuli paths form a 
coarsely contracting family of paths on V(S). 
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Proof There exist a hierarchy connecting any pair of points in M(S), which shows 
that the set of hierarchy paths form a transitive path family (similarly in V(S) for 
hierarchy-without-annuli paths). Fix H a hierarchy path between two points in A4(S) 
and use $ to denote . 

We need to show: there exist constants b and c so that if satisfy djv{(\i, //) < 

b ■ dM(v, $(^))» then diam($(/j), $(//)) < c. 

First notice that a key property of is that there exists a constant Q = 3M + 35' 
so that every domain A of a geodesic in H (including the main surface 5) has 
diam J 4( < I ) (/u) U Pa(ij)) < Q \ this fact is a step in the proof of Lemma 5.4. 

Fix a hierarchy G from ^(/u) to [i. Let Z be a domain of G of length at least 4Q, an 
assumption which implies that: 

(5-2) dzbiMf*)) > 3Q + M. 

In order to compute d^vt($(/x), <)?(//)) it is useful to calculate rfy(//, //) for each large 
domain F of //. Consider a subsurface Z. If Z is a large domain of H, then 
the definition of <£, Lemma 3.16, and Equation 5-1 together imply that dzifi, dY) > 
d z (fJ,,^(n))-5M-3d' . If Z is not a large domain of H, then d z (dY,$(fi)) < 6M+45'; 
applying the triangle inequality then gives dz(/i, dY) > dzin, ^KaO) — diamz($(^)) — 
d z {Hv\dY)-25>. 

Combining the two cases of the previous paragraph, for any surface Z, independent of 
whether or not it is a large domain in H, we have: 

(5-3) dzfa, dY) > d z (n, *0*)) - 3Q. 

Combining the above with inequality 5-2 we have that dz(fJ>, dY) > 3Q+M — 3Q = M, 
which by the Projection estimates Theorem implies that dy(fJ., dZ) < M. 

Now consider dzi.ii! , dY): either dz(y! , dY) > M for every large domain Y of H or for 
some domain Y of // we have dzifi' , dY) < M. 

In the first case, from Theorem 4.3 (Projection estimates) we have dy(n' , dZ) < M, 
which combines with the paragraph above to give dy(fJ.', fi) < 2M. At which point it is 
easy to check that there is a uniform bound on d_\4(&([i), &(fi>')) — the bound comes 
since dyi^ 1 ' ,n) < 2M implies that the pairs (£(/u), TZ(p-)) and (C(fx'), 7£(//)) agree 
(except for a switch of at most one surface which then must have diameter not much 
larger than the threshold for Q) and that A(/i) is coarsely the same as A(//). Carrying 
out this computation, one obtains ^/m^C^KmX < 10M + 65' . 
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In the other case, when there is some domain Y of H for which dz(//, BY) < M, we 
can combine this with equation 5-3 to give: 

(5-4) d z (», //) > dziii, S(m)) -3Q-M. 

This tells us that whenever dzi&ifi), A 4 ) > 2(3(2 + we have 

(5-5) dzin, A 4 ') > ^^zC^Cm), A 4 ) 

Thus, in domains Z of G for which dz(^(fi), /x) is sufficiently large, either: 
dz(^',dY) > M, and thus <£(//) and $(^0 are close in the markings complex, or 
dzifJ,, n') is a definite fraction of dz(&(n), A 4 )- 

If we can show that a definite proportion of the distance travelled in M. between 
and fx, say i^Jh^Ml j takes place in domains larger than a threshold 6<2 + 2M 
then we would be done by choosing the constant b so that b < d M(vMn)) xhis 
would be enough, as then fjf has no chance to move far enough away from \i to have 
dz(^', 9Y) < M, where Z is a large domain in G and Y is a large domain in H. 

That there exists a uniform constant a > for which the distance ^mO^Km)) occurs in 
large surfaces follows from the following counting argument. The fact that the only 
domains of S which support geodesies in H are the surface S and component domains 
of gH has as a consequence that if a hierarchy between / and T contains geodesies 
in N + I different component domains, then dc(s)U, T) > N. From this we see that 
either idyvf(S)($(A0) A 4 ) occurs in large domains, or the sum of the lengths of geodesies 
shorter than 6Q + 2M is larger than ldM(S)(^(l^), A 4 )- m the latter case, we then have 

\sh\ > d ^Q+2M) ] ■ Thus we always have that at least 2 (6Q+2M) d M(S)(<S>(v), A 4 ) occurs 
in the large surfaces, as claimed. 

Thus, choosing b = 2 (6q+2M) anc ^ c > + 6<5', we have proved the theorem. □ 



5.3 Hyperbolicity 

In this section we prove Theorems 5.2 and 5.1 using the contracting properties of 
hierarchy paths provided by Lemma 5.6. 

In [42] a contraction property is defined similar to Definition 5.5 and is used to prove 
5 -hyperbolicity of C(S); that property motivated the definition of the previous section. 
In this section, using results about R -trees we give a new proof that such contraction 
properties imply hyperbolicity, using techniques different than those of [42]. This 
proof provides a new perspective concerning the relationship between stability of 
quasigeodesics and hyperbolicity. 
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Theorem 5.7 If X is a geodesic space with a family of (K, C) -quasi-geodesic paths 
TL which have the coarsely contracting property, then X is 6 -hyperbolic. 

Theorems 5.2 and 5.1 follow from this theorem since Lemma 5.6 proves that in the low 
complexity cases M(S) (and V(S)) the hierarchy paths form a coarsely contracting 
family of paths (respectively hierarchy- without- annuli paths), and [43, Efficiency of 
hierarchies Theorem] implies that hierarchy paths are (K, C)-quasi-geodesics, with the 
constants depending only on the topological type of the surface. 

Proof The method of proof is to show that in the asymptotic cone $ induces a locally 
constant map to a certain family of bi-Lipschitz paths, we then show that this implies 
that Cone u (X, (di)) is an M-tree. In particular we will show for each pair of points 
x,y G Cone^X, (di)) there exists a map $ : Cone^X, (di)) — ► [x,y] such that: 

( 1 ) ^ | [x,y] is the identity 

(2) *|ctane w (X,<4))\[*,y] is locally constant. 

By our hypothesis, for each 77 G TL there exists a map : X — > X such that: 

(1) For each /iGXwe have <£//(/i) G 77. 

(2) There exist constants b, c > such that for any sequence of paths H n and 
points fi n for which r n = dxiUn^Hi/J-n)) grows linearly, then 
diam z ($ ff „(B & . rn (/i„))) < c. 

Fix three points I = (fyies, T = (7/),-^, Jx = (/i,-),eN £ Cone^X, (di)). Let H denote 
the path connecting 7 to t given by a rescaled sequence of paths Hi connecting 7, to 
T{ and assume p, 77 . Since TL has the coarsely contracting property, by definition 
we know that when dxifa, Hi) grows linearly in dj (say it grows roughly as g ■ di for 
a constant g) the ball with radius b • dx(pi,Hi) around /x, maps to a ball of radius 
at most c in 77,-. Thus in the rescaled metric space, X • X, we have d±. x ([ii,Hj) is 

approximately g and in this metric the ball 6/,.grf, (/i() maps to a set of diameter j in 77,-. 

Thus diam Comw ^x,ldi))((^(^bg(fJ'i)))) — nm a; j = 0> thereby showing that the map is 
locally constant. 

This proves that any embedded path connecting 7 and T must be a subset of 77. Since 
by hypothesis each 77 G TL is a (^T, C)-quasi-geodesics, we have that 77 is the ultralimit 
as i — > cc of (K, ^)-quasi-geodesics and is thus a TT-bi-Lipschitz embedded path. 
In particular it is homeomorphically embedded, so the only subpath connecting its 
endpoints is the path itself — thereby proving that there is a unique topological arc 
between any pair of points in Cone^(X). It is worth remarking that since Cone^X) is 
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a geodesies space, the path H which a fortiori need only be bi-Lipschitz embedded is 
actually the image of the geodesic between I and t . 

Mayer and Oversteegen's topological characterization of IR -trees [44] implies that in 
order to prove that Cone w (X) is an M-tree it suffices to show that it is uniquely arc wise 
connected and locally path connected. The first property was shown above, the second 
is true for any geodesic space. 

Thus each Cone^(X) is an M-tree and thus Theorem 2.7 implies that X is 5 -hyperbolic. 

□ 



6 Asymptotic cones of MCQ 

When £(S) > 2, the mapping class group of S is not hyperbolic and thus the results 
of the previous section clearly do not hold for arbitrary surfaces (similarly for the 
Teichmuller space of 5 when £(S) > 3). Nonetheless, in this section we show that 
negative curvature phenomena is present in certain directions. We begin by building a 
projection to mapping class groups of subsurfaces which we use to construct a subset 
which we call the strongly bounded subset, Fq, of the asymptotic cone. Theorem 6.5 is 
the main result of this section, it describes various topological properties of the subset 
Fq. The proof of Theorem 6.5 is similar in spirit to the proof of Theorems 5.1 and 5.2. 
The main difference, is that due to the lack of 5-hyperbolicity in the spaces being 
considered in this section, we have no hope of showing that there is a coarsely locally 
constant projection to every hierarchy path. We consider a subcollection of hierarchy 
paths for which we build such projections; it the ultralimit of this collection of path that 
yield the set Fq . 

Given any subsurface Z C 5, we define a projection ttm(Z) '■ -M(S) — > 2 M( - Z \ which 
sends elements of M.{S) to subsets of M.(Z). Given any /i € M.{S) we build a marking 
on Z in the following way. Choose an element 71 £ ttz(^) , and then recursively choose 
7„ from 7r z \u, < „7,(/ i )> ^ or eacn n — Now take these 7/ to be the base curves 

of a marking on Z. For each 7, we define its transversal f(7,) to be vr 7 ,(/i). This 
process yields a complete marking on Z, since there are £(Z) base curves and each has 
a transversal; work of Masur and Minsky [43, Lemma 2.4] shows that any complete 
marking uniquely defines a finite subset of complete clean markings of diameter at 
most 6. Thus this construction yields a bounded diameter subset of Ai(Z), but arbitrary 
choices in choosing the 7; were made along the way; we define ttm{Z){h) to be the 
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union of all possible markings built following this process, in the following lemma we 
show that this set has bounded diameter. 

Lemma 6.1 There exist a uniform bound depending only on S, which bounds the 
diameter of dmmM(Z)(^M(Z)(^)) for any \i G M(S) and Z C S. 

Proof Fix fi G M(S), Z C 5, and u, v 1 G ttm(Z)(^)- It suffices to consider u with 
base curves, 7,-, and transversals, w^Qi), as given above since [43, Lemma 2.4] implies 
that each such complete marking has a bounded diameter subset of M.{Z) associated 
with it (similarly for v 1 ). We consider the projections of v into C(Y) for each Y C Z and 
show that the image under these projections are close to 7Ty(/i). We proceed inductively, 
by considering the smallest i for which iryiji) 7^ 0- 

If i = 1 , then since de(Z)(7i> A*) < 1 > Lemma 3.3 (Lipschitz projection) implies that 
dc(Y)(l\, < 3 for any Y C Z. Thus we obtain dc(Y)(v, p) < 3. 

We now assume the inductive hypothesis that for a fixed n we have dc(Y){v, /x) < 3 
for all F C Z with 7ry(7,) 7^ for some i < n. Our goal is to show that if n is the 
smallest index of a curve 7„ G base(z^) with 7Ty(7„) 7^ 0, then dc(Y)(v, ff) < 3. Since 
7,- n F = when i < n, we have dc/(y)(7r y (/i), vr' y (7„)) < 1 . Thus by Lemma 3.3 we 
have dc(Y)(KY(if) , ^7(7,,)) < 3, and hence dy(fi, u) < 3. Thus for any surface Y C Z 
with 7ry(base(^)) / we have dy(/i, z/) < 3. 

It remains to deal with the case where Y is an annulus with base curve 7 G base(^), but 
here it follows from the definition of itm(Z){p) that tty(h) = ttf(^), thereby proving 
that for any v G ttm(Z){^) and any Y C Z we have dy(/U, z/) < 3. This implies that and 
any Y C Z we have diamy(zv, 1/) < 9. 

Now, Theorem 3.17 shows that dM(Z)(v, ^0 ~ Yl ycz dy(v, v'), but we just showed 

that when the threshold is taken larger than 9 then the sum on the right side is zero. 
Thus if we choose a threshold t > 9 in Theorem 3.17, then dM(Z)(v, v') is bounded by 
the additive quasi-isometry constant in the conclusion of that theorem. □ 

As in the case of the maps itz, we abbreviate dM(Z){^M(Z){^i), ^M(Z)i^i)) by writing 
dM(Z)(Hi, vi). 

Definition 6.2 Define a pair of points (fi, u) in Ai^iS) x Aiuj(S) to be of bounded 
growth if there exists a constant D for which: 

limsup{ < i M(Z) (/i i -, : Z C 5} < D < 00. 

For our fixed basepoint G A4(S), let Fq denote the set of v such that the pair (0, v) is 
of bounded growth. We use the term strongly bounded to refer to the elements of Fq . 
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Remark 6.3 Fixing a sequence of scaling factors d n and an ultrafilter oj, one can 
consider those sequences D(n) for which lim w = : such sequences are said to 
have sublinear growth ( with respect to the scaling constants d n and ultrafilter uj ). This 
definition allows one to consider, for instance, the set of points p, € Aiuj(S) for which 
the sequence {fi?x(Z)(/^i 5 0) : Z C 5} grows sublinearly, we call these points strongly 
sublinear (for the choices d n and lu ). (A special case of this is for the scaling constants 
d n = -, indeed this class of asymptotic cones was used in many of the original papers 
in the subject, eg, [23, 25, 18, 35].) 

We note that the remaining results and proofs of this section hold mutatis mutandis 
replacing "strongly bounded" by "strongly sublinear." This is often useful, but in the 
sequel we use the strongly bounded hypothesis in order to have a condition which holds 
in all asymptotic cones simultaneously. 

Example 6.4 If p G M.CQ(S) is pseudo-Anosov, then the sequence (p") n has distance 
from the origin, , growing at a linear rate. At the same time, there is a uniform bound 
M so that for all n <E N and Y C S one has d Y (p", 0) < M. This shows that |F | > 1 . 
This is an important example, as in this case, up to some quasi-isometry constants, all 
the mapping class group distance is accounted for by distance in the complex of curves 
of the surface S. 

This notion of degrees of boundedness provides an interesting stratification of M.^ ; 
Theorems 6.5 and Theorem 7.1 are examples of how this stratification provides 
information about the mapping class group. These results are refined further in 
Behrstock-Minsky's proof of the Rank Conjecture [6]. 

In Section 5 we used projections from M.(S) to hierarchy paths and showed that these 
maps had a strong contraction property. In fact, these maps were sufficiently contracting 
so that for each hierarchy path, in the asymptotic cone we obtained a locally constant map 
from M U (S) to a bi-Lipschitz path. From this we concluded that in the low complexity 
cases Ai^S) is an R-tree. This argument relied heavily on the fact that these surfaces 
had low topological complexity, nonetheless we extend the essential parts of that 
argument to produce strongly contracting maps in mapping class groups of surfaces of 
arbitrary complexity. In so doing, the following result shows that bi-Lipschitz flats in 
M-u, intersects Fq in a very simple way. This answers and extends Conjecture 6.4.6 
of [3]. This contraction property also shows that given a pseudo-Anosov element <ft 
and sufficiently large n G N, then any path in M.(S) \ B|^2n|(l) has length which is at 
least quadratic in | (f) 2 " \ . This holds since the contraction property shows that linear 
length paths in this space move at most a uniformly bounded amount when projected 
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to the quasi-geodesic from <j)~ n {\) to (f> n (l), therefore any path in A4(S) \ B^rfl) 
connecting <fi~ (1) to (1) must contain at least linearly many disjoint linear length 
subpaths. Except when it is virtually free, Ai(S) is thick of order 1 [4] and thus has at 
most quadratic divergence. Hence, except in those cases where it is virtually free, the 
mapping class group has a quadratic divergence function. See the next subsection for 
some further topological corollaries and a discussion of other applications. 

Theorem 6.5 Fo satisfies the following properties: 

(1) Fq is geodesically convex. 

(2) Any distinct pair of points in Fq are connected by a unique path in M. w . In 
particular, Fo is an M-tree. 

(3) In M. w , any bi-Lipschitz flat of rank larger than one intersects Fq in at most one 
point. 

Before proving this theorem we establish some notation and provide a lemma which we 
shall need in the proof. 

Similar to the definition in Section 5, for a hierarchy G we define the large domains of 
G as: 

Gg,c = {Y C S : d Y (I(G), T(G)) >C> 3M}. 

We will drop the C from the notation when the choice of C is irrelevant. Also, we 
define the initial large domains by: 

I G = {Y G Q G : Each Y' G Q which overlaps Y has Y < t Y'}. 

From finiteness of Q it follows /(; / 0; since any two domains of Iq are disjoint or 
nested there is a bound on \Iq\ depending only on the topological type of S. 

Lemma 6.6 If Y' G Qg \ Ig then there exists Y G Iq such that Y' overlaps Y . 

Proof Letting Y' ^ Ig, then there must exist a domain Y\ G Qq which overlaps Y' 
and for which Fi ^ t Y' . Either Y\ G Ig in which case we are finished, or we can find a 
surface Y2 G Qg which overlaps Y\ and for which Y<i -< t Y\. Since Y\ G Q , Lemma 3. 16 
implies that dy s (dY', dY2) > M and thus Y' must overlap Y2. Repeating this process 
finitely many times (since Qg is finite) yields a sequence Y -< t . . . -< t Y2 -< t Y\ ~< t Y' 
with Y G Ig and Y overlapping Y' . □ 

We now proceed to the proof of Theorem 6.5. 
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Proof Fix two points x = (x,),y = (y,-) G Fq. Fix a hierarchy Hj connecting x,- to 
yt and let denote the coiTesponding hierarchy path between them. We write 

[x,y] to denote the bi-Lipschitz path in M-^S) obtained by taking an ultralimit of these 
quasi-geodesic paths. 

(1) We will now show that every point of [x,y] is in Fo. By the definition of Fq, for 
every (a,-) G Seq \ (S) we have 

lim(max{d M{ai) (xi,0),d M(cii) (yi,0)}) < oo. 
Thus, in particular, for each (a,) G Seq \ (S) we have 
(6-1) lim —dM(ai)(xi,yi) = 0. 

We note the following useful property of hierarchy paths: if [p, u] is a hierarchy path 
from p to v and p G [/z, i/] , then for all Y C 5 one has dc(Y)(v, p) < dc(Y)(v, p) + 3M 
(where M is the constant of Theorem 4.3, depending only on S). Invoking Theorem 3.17 
and Lemma 6.1 this implies that dM(Y)(v, p) < KdM(Y)(v, p) + C, for constants K and 
C which depend only on S. 

By definition of the asymptotic cone, if (w ; ) G [x, y] , then for an w-large subsequence 
of indices the distances ^^(w;, [x,-,y,]) grow slower than the sequence d n ; as a 
consequence of the previous paragraph we see that for any (w,) G [x, y] and any 
(a;) G Seq \ (S) we have ^x(a,)(w,-,y,-) < KdM(ai){xuyd + C for ^-almost every 
i. Together with equation 6-1 this implies that for any (a,-) G Seq \ (S) we have 
lim^ 4 dM{ai)(yi, W/) = 0. Namely, we have shown that [x,y] C Fo, ie, Fo is 
geodesically convex. 

(2) This argument follows the general outline used to prove Lemma 5.6. Consider 
the family TL consisting of ultralimits of rescaled hierarchy paths connecting pairs of 
points in Fo . We shall show that for any x, y G Fo and any path [x, y] G H there is a 
continuous map <I> : M W (S) — > [x,y] such that: 

(1) $ | [ Xj y] is the identity 

(2) $ is locally constant outside [x, y] . 

Once this is established Theorem 5.7, which we previously used in conjunction with 
Lemma 5.6 to prove Theorem 5.1, implies that Fo is an M-tree. The two key differences 
between this argument and that of Lemma 5.6 are in the way in which the map $ is 
constructed and then the proof that this map is locally constant — since the present 
construction of $ works for surfaces of arbitrary complexity we must rely on the 
property of being strongly bounded for both these steps. Similar to the construction of 
$ in Section 5, we define our projection by considering a sequence of hierarchies and 
maps to subsets of their slices and then perform a limiting procedure. 
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<I> is well-defined 

Fix fi G M.u and let gE, C C(S) denote the main geodesic of Hi. By 5-hyperbolicity 
of C(S), a la Lemma 2.3, the closest point projection of 7Ts(//,) to g//, determines a 
subset C Mi C gHj whose diameter is uniformly bounded by a constant K(5), which 
depends only on the 5-hyperbolicity constant of C(S). Define <£(/x,) to be the set of all 
marking in [x^y,] whose base curves include any element 7 G C Ml . 

Note that given any element 7 G C^, , by strong boundedness there are at most a 
bounded number of elements of £(7) which appear as transverse curves to 7 in some 
marking of Hi . Further, strong boundedness also implies there is a uniform bound on 
diam^sy^fjx^y;]). Thus, inductively building the subset C [x,-,y,-] one sees that 
there is a uniform bound, K, on its diameter. 

In particular, since for each /i, G [x^y,] one has /i,- G 3>(/i,), the above uniform bound 
implies that diamx^(($(^,)), (//,}) = lim w ^CK") = 0. Thus one obtains that <£(^) is 
always a point, this also shows that restricts to the identity map on [x,y] . 

§ is locally constant off [x, y] 

Fix a point /i G .M ^(5) \ [x, y] and a point // G with af(/x, fi') < c ■ dip, 

for a positive constant c, depending only on the topological type of S, to be determined 
by the following argument. We want to show that for any such pair one obtains 
$(/*) = $(//)• 

Fix a representative for /i = (/x,-) and for each i consider a hierarchy G,- whose initial 
marking is a point of $(/x ; ) and with terminal marking ^t,- — this involves an arbitrary 
choice of element of 3>(/x,-), but the fact that diam( <£(//,•)) is uniformly bounded implies 
that any choices here yield the same point in the asymptotic cone. As usual, gG, will 
denote the main geodesic of G, . 

The remained of the argument will be divided into two cases depending on the length 
of gGj- Let K denote a fixed constant coming from Theorem 3.17 (K is chosen by 
taking the quasi-isometry constant given by Theorem 3.17 corresponding to a threshold 
t > AM + D, where M is the constant given by Theorem 4.3 and D is the constant 
quantifying the strong boundedness of the pair (x,y), as given in Definition 6.2). 

Case (i) 

\go\ > j(d(n, $(/u)) More precisely, here we assume that for u;-almost every i the 
length of g G , is larger than ^d M (S)(W, $(>;))• 
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Let c < . Hyperbolicity of C(S) , then implies that there is a uniform bound on 
diamc (s)(C([ii) U C(/x-)) (see Theorem 3.2 and Lemma 2.3). The proof that $ is well 
defined, then implies that <£(/!) = <£(//)■ 

Case (ii) 

\gc\ < jcd(n, We now consider the cases not covered by Case (i), namely we as- 

sume that for u-almost every i the length of go, is at most 
^d.M{S)(V>u ^(/-t/))- The choice of K implies that for w-almost every i, any hierarchy 
between /i/ and must have a domain Y C S for which c/y(/i,-, 3>(/x,-)) > 4M + D, 

where M is the constant given by Theorem 4.3 and D is the strongly boundedness constant 
for the pair (x,y). Fix a constant < c < 1 and let /i, = /j^o, /if i, /ii,2, • ■ • 5 Mi,* = M; be 
an elementary move sequence of a hierarchy path in M(S) with & < cdM(S)(Hii $(A*/))- 

For any surface F C 5, define 

= e [0, *] : drimj, A*i) > 4M + D and d y ()U iV , $(//,■)) > 4M + D}. 

We now give a brief digression to study properties of y . Suppose j G /; y , Z G z , and 
y , Z C 5 overlap. Then, Y and Z are both large domains for G,- , and in the time ordering 
on the hierarchy G, either Y -< t Z or Z -< t Y , for the remainder of the paragraph 
we assume the first ordering (in the other case an analogous argument can easily be 
provided by the reader). Lemma 3.16 shows that Y ^ t Z implies dy(dZ,fii) < M 
and dz(dY, <3?(/i,)) < M; the first inequality combines with the definition of /,- y to 
imply that dyiwj, dZ) > 3M + D. Theorem 4.3 then implies that dz(fJ-ij, BY) < M, 
which combines with the second inequality of the previous sentence to show that 
dzi^ij, < 2M. Thus j Ji z and we have shown that for any y, Z C S we have 

h,Y H Ji : z = 0- Thus /,- y and J, z share a point only if they are disjoint or nested, a 
straightforward topological count gives a bound of 2£(S) for the number of /,- y which 
can cover a given j. Furthermore, this argument shows that in any hierarchy with 
initial marking /i,-, then either dc(Y)(l^i, Mij) < 2M for all y' or the projection to C(Z) of 
the terminal marking of this hierarchy is within M of the point 3>(/i,-) ; when the later 
condition is satisfied we say that this hierarchy has traversed Z. 

What we shall now show is that the hierarchy from /x,- to /i- can not possibly traverse 
all the large domains appearing in the hierarchy G, . This then implies that there exists 
a large domain of G/, which we will call Y-, in which de(Yi)(fJ', M/) < 2M. The strongly 
bounded hypothesis implies that diamc(y i ?)([x i -,.y,]) < ^, together with the fact that 
dc(Y!)(fJ', ^(Mi)) > 4M + D this allows us to conclude that the main geodesic of any 
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hierarchy from //• to [x,-,y,] must pass through the vertex dY\ G C(S). From this and 
the fact that [xi,yi] C Fo we then give a quick deduction of the theorem. 

Recall that Gq,4m+d denotes the set of domains in G, supporting geodesies larger than 
4M + D ; unless noted otherwise all time orderings on domains will refer to the ordering 
in the hierarchy G, . Recall that the set of large initial domains for G, has cardinality at 
most 2£(S). Thus, by Lemma 6.6 there exists Y- C S with Y\ G Ggi,4m+d, such that if 
we define 

X = {Y[} U {Z G a G ,,4M+D : ^ -< f Z}, 
then for w-almost every z we have ^ dc(z)(p>i, 3>(/-t;)) > aG?M(S)(Mi; ^(/^r))* f° r a 
fixed a > independent of i. 

Since the /, z cover the path from ^, to /i • with degree at most 2£(S) , we have 2k£(S) > 
Ylzeij \ J',z\ ■ Now we use the result from [43, page 962] that there exists a constant 
(3 > (depending only on the topological type of 5) for which \J^z\ > (3dc(Z)(^i, ^(/J;)) 
if the path from /i; to ^- traverses the large domain Z. Thus, if all the domains Z G X 
are traversed between /x,- and /i ■ , we have 

2k&S) > \Ji,z\ > £}/3<fc(Z)(/*i,*0i/)) ^ a/3dM(S)(»iM»i))- 
zeii zeii 

Thus, k > ^ws^Mv,)) . jajjjjjg c < we obtain a contradiction, and hence we 
may assume that not all the large domains in X are traversed between m and /x • . 

In particular, we may assume Y\ can not be completely traversed between /x,- and /x- and 
is thus a large domain for any hierarchy from /i- to 3>(/i;). Therefore, the main geodesic 
of any hierarchy from /x- to <£(j(x,) contains the vertex dY[ . If dc{S)(dY' h <3?(/x,)) > 5, 
then any C(S) geodesic from vrc(s)(^-) to the main geodesic of //, must pass within 5 
of dY'j . This yields a uniform bound on diam(C M , U C^<) and the same argument given 
before shows that in this case <£(/x) = ^(/x')- 

Thus, we may assume that the only choices for Y\ are those which satisfy 
dc(s)0Y'j, $(/x;)) < <5. It then follows from the Move distance and projections Theorem 
that for u-almost every i there exists a domain <2, for which dc(s){dQi, ^(/i/)) < 5 and 
with lim^ dc(Qi){lJii, $(/*/)) — > oo. On the other hand, the strong boundedness of the pair 
implies that diamc(Q i )([x/,y ( ]) < D for cj-almost every i, hence by Lemma 3.14, 
any hierarchy between /x- and any point on [Xi,yi\ must have a geodesic supported on 
Qi. Thus the main geodesic of any hierarchy between /x- and any point on [x;,y ; ] must 
pass through the point dQi . This provides a uniform bound on the distance between 
$(jm) and and therefore shows that <£(/i) = , completing the proof. 
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(3) Any bi-Lipschitz flat which intersects Fq in more than one point contains an arc 
of Fq . If this flat has rank greater than one, then this arc must be part of an embedded 
circle. But this would contradict the fact that there is a unique path in M. u between 
each pair of points of Fq . 

Thus a bi-Lipschitz flat of rank greater than one can intersect Fq in at most one point. □ 

7 Cont^iMCQ) is tree-graded 

We begin by stating a consequence of Theorem 6.5 and use this as a segue to mention 
some interesting questions about the mapping class group. In Theorem 6.5 we obtain 
a unique path in Ai u between any pair of points in Fq. From this it follows that the 
removal of any point of Fo disconnects M.^, ie, if S is any surface and M^S) is any 
asymptotic cone of MCQ{S), then M-uj(S) has a global cut-point. Since asymptotic 
cones of groups are homogeneous, this can be stated as: 

Theorem 7.1 If S is any surface and M^iS) is any asymptotic cone of MCQ(S) , then 
every point of M. U {S) is a global cut-point. 

We now mention the following definition which was introduced in [20]. 

Definition 7.2 Let F be a complete geodesic metric space and let V be a collection 
of closed geodesic subsets (called pieces). The space F is said to be tree-graded with 
respect to V when the following two properties are satisfied: 

• The intersection of each pair of pieces has at most one point. 

• Every simple geodesic triangle in F is contained in one piece. 

In Remark 2.31 of [20], it is noted "every asymptotic cone of a group which has a 
cut-point is tree graded with respect to a uniquely determined collection of pieces each 
of which is either a singleton or a closed geodesic subset without cut-points." In light 
of this remark, Theorem 7.1 implies: 

Corollary 7.3 For every surface S, each asymptotic cone, M^iS), is tree-graded. 

This result raises the question of describing the pieces. A useful object in the study of 
tree-graded structures is the transversal tree containing a point fi : this is defined to be 
the maximal tree (possibly consisting of only the point n), satisfying the property that it 
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intersects each piece in at most one point. One consequence of Theorem 6.5 is that Fq 
is a subset of the transversal tree containing the basepoint 0. (Similarly by defining 
for any [i G Aiu(S), we obtain a subset of the transversal tree containing the point fi.) 

One of the questions solved by [20] is a proof of quasi-isometric rigidity for groups 
whose asymptotic cones admit tree-gradings of a certain form, namely the asymptotic 
cones of their peripheral subgroups are cut-point free (see [4] for a generalization). 
This approach might shed light on the following, which remains a major outstanding 
conjecture in the study of the mapping class group: 

Conjecture 7.4 Let S be a non-exceptional surface of finite type. For any finitely 
generated group G quasi-isometric to the mapping class group MCQ{S), there exists a 
homomorphism G — ► M.CQ(S) with finite kernel and finite index image. 

Informally, this question asks if the mapping class group is uniquely determined among 
finitely generated groups by its large scale geometry. (See [47] for a further discussion 
and a proof of Conjecture 7.4 for mapping class groups of once punctured surfaces.) 

Another interesting question is to determine: to what extent are mapping class groups 
relatively hyperbolic. There are several competing definitions for relative hyperbolicity, 
one of which is that the Cayley graph "electrified" over the cosets of a finite collection of 
subgroups is 5 -hyperbolic, this is now often called weak relative hyperbolicity. Another 
is weak relative hyperbolicity combined with a hypothesis called the Bounded Coset 
Penetration Property (BCP), which places restrictions on how paths can travel between 
these cosets; this condition was first formulated by Farb [21] and is often called (strong) 
relative hyperbolicity. Many people have since studied another equivalent versions of 
relative hyperbolicity which was formulated by Gromov [24] and then elaborated on by 
Bowditch, Osin, Szczepahski, and others [10, 49, 52]. 

Another main result in [20] is that a group, G , is relatively hyperbolic with respect to 
a collection of subgroups, H\ , . . . , H n , which satisfy BCP if and only if Cone w (G) is 
tree-graded with respect to ultralimits of cosets of the subgroups //, . They ask the open 
question of whether any group whose asymptotic cones have cut-points is relatively 
hyperbolic (See Problem 1.18 of [20]). Masur and Minsky showed that MCQ(S) is 
weakly hyperbolic relative to stabilizers of multicurves [42] , thus one knows that the 
maximal subsets without cut-points in M^iS) are not ultralimits of these subgroups. 
It is easy to verify that multicurve stabilizers do not satisfy BCP, nonetheless, there 
could be another collection of subgroups with respect to which the mapping class group 
is relatively hyperbolic. Accordingly, in an earlier version of this paper we asked the 
fundamental question: 
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Question 7.5 Is there a collection of subgroups of AiCG(S) with respect to which this 
group is strongly relatively hyperbolic? 

An answer to this question can now be given. The first author in joint work with Druju 
and Mosher [4] and independently work of Anderson, Aramayona, and Shackleton [1] 
gives a complete answer to this question by showing that for no family of subgroups is 
the mapping class group strongly relatively hyperbolic. 

Corollary 7.3 contrasts with the result of [4] and [1] to yield an example of a group 
whose asymptotic cone is tree-graded, but the group is not strongly relatively hyperbolic 
for any choice of subgroups; this answers Problem 1.18 of [20] and does so with a 
finitely presented group. In fact, in [4] a stronger result is shown, namely, there is no 
collection of subsets of the mapping class group for which Cone w (M.CQ) is tree-graded 
with pieces coming from ultralimits of these subsets; thus the result that Cone^iAiCG) 
is, nonetheless, tree-graded answers another question of [20] concerning whether pieces 
in the asymptotic cone always arise as ultralimits of subsets. 

We note the following question which this raises. 

Question 7.6 What other groups besides M-CQ(S) have tree-graded asymptotic cones, 
but are not strongly relatively hyperbolic with respect to any subgroups? . . . subsets? 

After learning of our methods, B. Kleiner informed us that using analogous methods 
one can show that graph manifolds satisfy the property of having tree-graded asymptotic 
cones, but are not strongly relatively hyperbolic with respect to any subgroups [38]. 
(See [36] for a related discussion. See also [4] for a proof of the latter fact.) 

The results about asymptotic cones throughout this work have all been independent 
of the choice of ultrafilter. In [40] it is shown that if one assumes the negation of the 
Continuum Hypothesis, then there are uniform lattices in certain semi-simple Lie groups 
with 2 2 " different asymptotic cones up to homeomorphism (here uj denotes the first 
uncountable ordinal). Even in the presence of the Continuum Hypothesis, [20] produce 
examples of finitely generated groups with 2 W non-homeomorphic asymptotic cones. 
These results make natural the following two questions. 

Question 7.7 

(1) How many non-homeomorphic asymptotic cones does the mapping class group 
have? 

(2) (Sapir) How many non-bi-Lipschitz equivalent asymptotic cones does the 
mapping class group have? 
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Note that for the examples given in [20] of finitely generated groups with 2^ non- 
homeomorphic asymptotic cones, the fundamental group can be used to distinguish 
the different asymptotic cones. Producing non-homeomorphic asymptotic cones for 
the mapping class group would require more subtle techniques since automaticity of 
the mapping class group implies that every asymptotic cone for this group is simply 
connected — this follows from results in [50] and [48]. 
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